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SEQUENCE LISTING 

(i) general\inpormation: 

(i) APPLICANT: (Other than US) AMRAD OPERATIONS PTY LTD 



(ii) TITLE OF\ 



(US Only) 

iNTION: THERAPEUTIC AND DIAGNOSTIC AGENTS 



(iii) NUMBER pF SEQUENCES: 49 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DA VIES COLLISON CAVE 

(B) STREEi: 1 LITTLE COLLINS STREET 

(C) CITY: MELBOURNE 

(D) STATE:lviCTORlA 

(E) COUNTRY: AUSTRALIA 

(F) ZIP: 30001 



(v) COMPUTER 

(A) MEDIUM 

(B) COMPUTI 

(C) OPERA! 

(D) SOFTWi 



i AD ABLE FORM: 
rYPE: Floppy disk 
: IBM PC compatible 
\ SYSTEM: PC-DOS/MS-DOS 
: Patentln Release #1.0, Version #1.25 



(vi) CURRENT APPLICATION DATA: 

(A) APPUCATION NUMBER: PCT INTERNATIONAL 

(B) FILING DATE: 31-OCT-1997 

(viVPRIOR APPLICATION DATA: 

(A) APPLICATION\nUMBER: P051 17 

(B) FILING DATE: U4-FEB-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PO 3384 

(B) FILING DATE: dl-NOV-1996 

(viii) aTTORNEY/AGENtWoRMATION: 
(A) NAME: HUGHES pR, E JOHN L 

(C) REFERENCED OC1KET NUMBER: EJH/EK 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: +61 B 9254 2777 

(B) TELEFAX: +61 3 9+54 2770 
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>) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

\(ii) MOLECULE TYPE : DNA 

|fxi> SEQUENCE DESCRIPTION: SEQ ID NO:l: 



CACGCCGCCC ACGTGAAGGC 



20 



(2) INFORMATION FOR SEQ ID NO: 2: 

) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(iii MOLECULE TYPE: DNA 
{xi)\ SEQUENCE DESCRIPTION: SEQ ID NO:2: 
TTCGCCAATG ACAAGACGCT 



20 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) Sequence characteristics: 

(A) LENGTH: 1236 base pairs 

1(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

1(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

Tlx) FEATURE : 

[A) NAME /KEY; CDS 

(B) LOCATION: 1..636 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CGAGGCTCAA GCTCCGGGCG GATTCTGCGT GCCGCTCTCG CTCCTTGGGG TCTGTTGGCC 
GGCCTGTGCC ACCCGGACGC CCGGCTCACT GCCTCTGTCT CCCCCATCAG CGCAGCCCCG 
GACGCTATGG CCCACCCCTC CAGCTGGCCC CTCGAGTAGG 



ATG GTA GCA 1 CGC AAC CAG GTG GCA GCC GAC AAT GCG ATC TCC CCG GCA 

Met Val Ala Arg Asn Gin val Ala Ala Asp Asn Ala lie Ser Pro Ala 

1 I 5 10 15 

GCA GAG CCC jCGA CGG CGG TCA GAG CCC TCC TCG TCC TCG TCT TCG TCC 

Ala Glu Pro (Arg Arg Arg Ser Glu Pro Ser Ser Ser Ser Ser Ser Ser 

20 25 30 

TCG CCA GCG GCC CCC GTG CGT CCC CGG CCC TGC CCG GCG GTC CCA GCC 



— *Q — S7: 1 6: 
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Ser Pro\Ala Ala Pro Val Arg Pro Arg Pro Cys Pro Ala Val Pro Ala 
35 40 45 

CCA GCC &CT GGC GAC ACT CAC TTC CGC ACC TTC CGC TCC CAC TCC GAT 
Pro Ala Hro Gly Asp Thr His Phe Arg Thr Phe Arg Ser His Ser Asp 



50 

TAC CGG C4 
Tyr Arg Ai 
65 

TAT TGG 
Tyr Trp 



55 



60 



ATC ACG CGG ACC AGC GCG CTC CTG GAC GCC TGC GGC TTC 
lie Thr Arg Thr Ser Ala Leu Leu Asp Ala Cys Gly Phe 
70 75 80 

CCC CTG AGC GTG CAC GGG GCG CAC GAG CGG CTG CGT GCC 
Pro Leu Ser Val His Gly Ala His Glu Arg Leu Arg Ala 
85 90 95 



GAG CCC GTG GGC ACC TTC TTG GTG CGC GAC AGT CGT CAA CGG AAC TGC 
Glu Pro Va^ Gly Thr Phe Leu Val Arg Asp Ser Arg Gin Arg Asn Cys 
100 105 110 

TTC TTC GCGl CTC AGC GTG AAG ATG GCT TCG GGC CCC ACG AGC ATC CGC 
Phe Phe Alal Leu Ser Val Lys Met Ala Ser Gly Pro Thr Ser He Arg 
115 120 125 

GTG CAC TTCjCAG GCC GGC CGC TTC CAC TTG GAC GGC AGC CGC GAG ACC 
Val His Phe IGln Ala Gly Arg Phe His Leu Asp Gly Ser Arg Glu Thr 
130 1 135 140 

TTC GAC TGC fcTT TTC GAG CTG CTG GAG CAC TAC GTG GCG GCG CCG CGC 
Phe Asp Cys Leu Phe Glu Leu Leu Glu His Tyr Val Ala Ala Pro Arg 
145 150 155 160 



CGC ATG TTG 
Arg Met Leu 



GAG CTG TGT 
Glu Leu Cys 



GCC CCG CTG CGC CAG CGC CGC GTG CGG CCG CTG CAG 
Ala Pro Leu Arg Gin Arg Arg Val Arg Pro Leu Gin 
165 170 175 

CAG CGC ATC GTG GCC GCC GTG GGT CGC GAG AAC CTG 
Gin Arg He Val Ala Ala Val Gly Arg Glu Asn Leu 
195 190 



GCG CGC ATC CaT CTT AAC CCG GTA CTC CGT GAC TAC CTG AGT TCC TTC 
Ala Arg He Palo Leu Asn Pro Val Leu Arg Asp Tyr Leu Ser Ser Phe 
195 200 205 



CCC TTC CAG AT 
Pro Phe Gin life 
210 



TGA CCGGCTG CCGCTGTGCC GCAGCATTAA GTGGGGGCGC 



CTTATTATTT CTTATTATTA ATTATTATTA TTTTTCTGGA ACCACGTGGG AGCCCTCCCC 
GCCTGGGTCG GAGtJGAGTGG TTGTGGAGGG TGAGATGCCT CCCACTTCTG GCTGGAGACC 
TCATCCCACC TCTC AGGGGT GGGGGTGCTC CCCTCCTGGT GCTCCCTCCG GGTCCCCCCT 
GGTTGTAGCA GCTT GTGTCT GGGGCCAGGA CCTGAATTCC ACTCCTACCT CTCCATGTTT 
ACATATTCCC AGTSTCTTTG CACAAACCAG GGGTCGGGGA GGGTCTCTGG CTTCATTTTT 
CTGCTGTGCA GAAT ATCCTA TTTTATATTT TTACAGCCAG TTTAGGTAAT AAACTTTATT 
TTAAAA GAAAAAAAAA AAAAAAAAA 



ATGAAAGTTT TTTT 

(2) INFORMATION 
<i> 



(At 



FOR SEQ ID NO: 4: 



SEQU SNCE 



CHARACTERISTICS : 
LENGTH : 212 amino acids 



192 

240 

288 

336 

384 

432 

480 

528 

576 

624 

676 

736 
796 
656 
916 
976 
1036 
1075 
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( B) TYPE: amino acid 
(D) TOPOLOGY: linear 

^{ii> MOLECULE TYPE: protein 

ixx) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Met vail Ala Arg Asn Gin val Ala Ala Asp Asn Ala lie Ser Pro Ala 
1 \ 5 10 15 

Ala Glu |pro Arg Arg Arg Ser Glu Pro Ser Ser Ser Ser Ser Ser Ser 
20 25 30 

Ser Pro Ala Ala Pro Val Arg Pro Arg Pro Cys Pro Ala val Pro Ala 
\5 40 45 

Pro Ala Pto Gly Asp Thr His Phe Arg Thr Phe Arg Ser Hi3 Ser Asp 
50 \ 55 60 

Tyr Arg Art He Thr Arg Thr Ser Ala Leu Leu Asp Ala Cys Gly Phe 
55 \ 70 75 80 

Tyr Trp Gl^ Pro Leu Ser Val His Gly Ala His Glu Arg Leu Arg Ala 
85 90 95 

Glu Pro VallGly Thr Phe Leu val Arg Asp Ser Arg Gin Arg Asn Cys 
1100 105 110 

Phe Phe Ala Leu Ser Val Lys Met Ala Ser Gly Pro Thr Ser He Arg 
115 \ 120 125 

Val His Phe G\n Ala Gly Arg Phe His Leu Asp Gly Ser Arg Glu Thr 
130 \ 135 140 

Phe Asp Cys Leb Phe Glu Leu Leu Glu His Tyr Val Ala Ala Pro Arg 
145 \ 150 155 160 



Arg Met Leu Gl} 



Glu Leu Cys Arg 
180 

Ala Arg He Pro 
195 

Pro Phe Gin He 
210 



(2) INFORMATION 



Ala Pro Leu Arg Gin Arg Arg Val Arg Pro Leu Gin 
165 170 175 

Gin Arg He Val Ala Ala Val Gly Arg Glu Asn Leu 
185 190 

Leu Afin Pro Val Leu Arg Asp Tyr Leu, Ser Ser Phe 
200 205 



FOR SEQ ID NO; 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(ii) MOLECULE TYPE: DNA 



fix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION : 223 . .819 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GCGATCTGTG GGTGkCAGTG TCTGCGAGAG ACTTTGCCAC ACCATTCTGC CGGAATTTGG 



AGAAAAAGAA CCAG 
ACACTCTCGT Tl 



ECGCTT 



CCAGTCCCCT CCCCCTCCGC CAC CATTTCG GACACCCTGC 
TAC CCTGTGACTT CCAGGCAGCA CGCGAGGTCC ACTGGCCCCA 



GCTCGGGCGA CCAGCTGTCT GGGACGTGTT GACTCATCTC CC ATG ACC CTG CGG 

Met Thr Leu Arg 
1 

TGC CTG GAG CCC Tfcc GGG AAT GGA GCG GAC AGG ACG CGG AGC CAG TGG 
Cys Leu Glu Pro Sfer Gly Asn Gly Ala Asp Arg Thr Arg Ser Gin Trp 



10 



15 



20 



GGG ACC GCG GGG 
Gly Thr Ala Gly LeV 



GCG AAA GCC CTG 
Ala Lys Ala Leu Arc 
40 



CCG GAG GAA CAG TCC CCC GAG GCG GCG CGT CTG 
Pro Glu Glu Gin Ser Pro Glu Ala Ala Arg Leu 
30 35 

GAG CTC AGT CAA ACA GGA TGG TAC TGG GGA AGT 
Glu Leu Ser Gin Thr Gly Trp Tyr Trp Gly Ser 
45 50 



ATG ACT GTT AAT GAA \ GCC AAA GAG AAA TTA AAA GAG GCT CCA GAA GGA 
Met Thr Val Asn Glu lAla Lys Glu Lys Leu Lys Glu Ala Pro Glu Gly 
55 I 60 65 

ACT TTC TTG ATT AGA GAT AGT TCG CAT TCA GAC TAC CTA CTA ACT ATA 



Thr Phe Leu lie Arg 
70 

TCC GTT AAG ACG TCA 
Ser Val Lys Thr Ser 
85 

GAT GGG AAA TTC AGA 
Asp Gly Lye Phe Arg L^ 
105 

AAA CAG TTT GAC AGT G1 
Lys Gin Phe Asp ser va 
120 

TGC AAG GAT AAA CGG A* 
Cys Lys Asp Lys Arg Thjt 
135 



CAC CTG TAC CTG ACC 
His Leu Tyr Leu Thr 
150 



Lyfe 



sp Ser Ser His Ser Asp Tyr Leu Leu Thr lie 
75 80 

:T GGA CCG ACT AAC CTG CGG ATT GAG TAC CAA 
La Gly Pro Tnr Asn Leu Arg He Glu Tyr Gin 
30 95 100 

GAT TCT ATC ATA TGT GTC AAG TCC AAG CTT 
Asp Ser He He Cys Val Lys Ser Lys Leu 
110 115 

GTT CAT CTG ATT GAC TAC TAT GTC CAG ATG 
Val His Leu He Asp Tyr Tyr Val Gin Met 
125 130 

GGC CCA GAA GCC CCA CGG AAT GGG ACT GTT 

Gly Pro Glu Ala Pro Arg Asn Gly Thr Val 
140 145 

CCT CTG TAT ACA TCA GCA CCC ACT CTG CAG 
Pro Leu Tyr Thr Ser Ala Pro Thr Leu Gin 
155 160 



CAT TTC TGT CGA CTC GCC ATT AAC AAA TGT ACC GGT ACG ATC TGG GGA 



His Phe Cys Arg Leu Ali 
165 17< 

CTG CCT TTA CCA ACA AC 
Leu Pro Leu Pro Thr Ar^ 
135 



He Asn Lys Cys Thr Gly Thr He Trp Gly 
175 180 

CTA AAA GAT TAC TTG GAA GAA TAT AAA TTC 
Leu Lys Asp Tyr Leu Glu Glu Tyr Lys Phe 
190 195 



CAG GTA TAAGTATTTC TCTqTCTTTT TCGTTTTTTT TTAAAAAAAA AAAAACACAT 
Gin val 



60 
120 
180 
234 

282 

330 

378 

426 

474 

S22 

570 

618 

666 

714 

762 

810 

866 



GCCTCATATA GACTATCTCC GAATGCAGCT ATGTGAAAGA GAACCCAGAG GCCCTCCTCT 926 
GGATAACTGC GCAGAATTCT CTCTTAAGGA CAGTTGGGCT CAGTCTAACT TAAAGGTGTG 986 
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aagatgtagcVtaggtatttt aaagttcccc ttaggtagtt ttagctgaat gatgctttct 
ttcctatggc \gctcaagat caaatggccc ttttaaatga aacaaaacaa aacaaaacaa 

AAAAAAAAAA 



1046 
1106 
1121 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 
ab) TYPE: amino acid 
(fo) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
Met Thr Leu Ari Cys Leu Glu Pro Ser Gly Asn Gly Ala Asp Arg Thr 

Arg Ser Gin TrpUly Thr Ala Gly Leu Pro Glu Glu Gin Ser Pro Glu 

20 I " 
Ala Ala Axg Leu La Lys Ala Leu Arg Glu Leu Ser Gin Thr Gly Trp 

35 1 ^ 

Tyr Trp Gly Ser Lec Thr Val Asn Glu Ala Lys Glu Lys Leu Lys Glu 

50 I 55 

Ala Pro Glu Gly ihr Phe Leu He Arg Asp Ser Ser His Ser Asp Tyr 

65 1 70 75 

Leu Leu Thr He sjer Val Lys Thr Ser Ala Gly Pro Thr Asn Leu Arg 

He Glu Tyr Gin iL Gl * L * s Phe *£f LeU ASP Ser 116 Ht 

100 \ 105 



Lys Ser Lys Leu L; 
115 



•s Gin Phe Asp Ser Val Val His Leu He Asp Tyr 
120 125 



Tyr Ve-irGln Met Cy 
130 



s Lys Asp Lys Arg Thr Gly Pro Glu Ala Pro Arg 
135 140 



Asn Gly Thr val HiW Leu Tyr Leu Thr Lys Pro Leu Tyr Thr Ser Ala 



145 

Pro Thr Leu Gin Hi 
16 

Thr He Trp Gly Le 
180 



Glu Tyr Lys Phe Gli Val 
195 



150 

Phe Cys Arg Leu Ala He Asn Lys Cys Thr Gly 
170 175 

x Pro Leu Pro Thr Arg Leu Lys Asp Tyr Leu Glu 
r 185 190 



(2) INFORMATION FOR 



(i) SEQUENCE 

<Ai LENGTH 
(B) TYPE 
(C) 
(D) 



TOPOIOGY 



(ii) MOLECULE TYPE: DMA 



SEQ ID NO : 1 



CHARACTERISTICS : 
[*. 2187 base pairs 
nucleic acid 

single 
linear 



STRA1S DEDNESS: 
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(ix) FEATURE: 

;A) NAME/KEY: CDS 
,B) LOCATION: 18.. 695 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

CGCTGCCTCC (gtgcgcc ATG GTC ACC CAC AGC AAG TTT CCC GCC GCC GGG 
Met Val Thr His Ser Lys Phe Pro Ala Ala Gly 
15 10 



ATG AGC CGC 
Met Ser Arg 



AAA AGC GAG 
Lys Ser Glu 
30 

AGC GGA TTC 
Ser Gly Phe 
45 



CCC CTG GAC ACC AGC CTG CGC CTC AAG ACC TTC AGC TCC 
Pro Leu Asp Thr Ser Leu Arg Leu Lys Thr Phe Ser Ser 
15 20 25 

TAC CAG CTG GTG GTG AAC GCC GTG CGC AAG CTG CAG GAG 
Tyr Gin Leu Val Val Asn Ala Val Arg Lys Leu Gin Glu 
35 40 

PAC TGG AGC GCC GTG ACC GGC GGC GAG GCG AAC CTG CTG 
ryr Tip Ser Ala Val Thr Gly Gly Glu Ala Asn Leu Leu 
50 55 



CTC AGC GCC CAG CCC GCG GGC ACC TTT CTT ATC CGC GAC AGC TCG GAC 
Leu Ser Ala <;lu Pro Ala Gly Thr Phe Leu lie Arg Asp Ser Ser Asp 
60 65 70 75 

CAG CGC CAC TfTC TTC ACG TTG AGC GTC AAG ACC CAG TCG GGG ACC AAG 
Gin Arg His I he Phe Thr Leu Ser Val Lys Thr Gin Ser Gly Thr Lys 
80 85 90 



AAC CTA CGC 
Asn Leu Arg 



CCC CGA AGC 
Pro Arg Ser 
110 

GTG CAC CAC 
Val His His 

trs- 

CCC ACG GAA 
Pro Thr Glu 
140 

CTC CCC GGG 
Leu Pro Gly 



A[TC CAG TGT GAG GGG GGC AGC TTT TCG CTG CAG AGT GAC 
e Gin Cys Glu Gly Gly Ser Phe Ser Leu Gin Ser Asp 
5 100 105 

AtG CAG CCA GTT CCC CGC TTC GAC TGT GTA CTC AAG CTG 
ir Gin Pro Val Pro Arg Phe Asp Cys Val Leu Lys Leu 
115 120 



GAG AAG ATT 
Glu Lys lie 



ACC CTC CAG 
Thr Leu Gin 
190 

TAT GAG AAA 
Tyr Glu Lys 
205 

CAG TAT GAT 
Gin Tyr Asp 
220 



TAC ATG CCG CCT CCA GGG ACC CCC TCC TTT TCT TTG CCA 

Tyr Met Pro Pro Pro Gly Thr Pro Ser phe Ser Leu Pro 
130 135 

C<tC TCG TCC GAA GTT CCG GAG CAG CCA CCT GCC CAG GGA 

PTO Ser Ser Glu Val Pro Glu Gin Pro Pro Ala Gin Ala 

145 150 155 

ACjT ACC CCC AAG AGA GCT TAC TAC ATC TAT TCT GGG GGC 

S^r Thr Pro Lys Arg Ala Tyr Tyr lie Tyr Ser Gly Gly 

160 165 170 

COG CTG GTA CTG AGC CGA CCT CTC TCC TCC AAC GTG GCC 

Pio Leu Val Leu Ser Arg Pro Leu Ser Ser Asn Val Ala 
175 180 185 

CAT CTT TGT CGG AAG ACT GTC AAC GGC CAC CTG GAC TCC 

Hi(s Leu Cys Arg Lys Thr Val Asn Gly His Leu Asp Ser 
195 200 

GTb ACC CAG CTG CCT GGA CCC ATT CGG GAG TTC CTG GAT 

va[L Thr Gin Leu Pro Gly Pro lie Arg Glu Phe Leu Asp 
210 215 

GCjT CCA CTT TAAGGAGCAA AAGGGTCAGA GGGGGGCCTG 
Alp Pro Leu 
225 



50 



98 



146 



194 



242 



290 



338 



386 



434 



482 



530 



578 



626 



674 



722 
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GGTCGGTCGQ TCGCCTCTCC TCCGAGGCAC ATGGCACAAG CACAAAAATC CAGCCCCAAC 782 

GGTCGGTAGC \ TCCCAGTGAG CCAGGGGCAG ATTGGCTTCT TCCTCAGGCC CTCCACTCCC 842 

GCAGAGTAGA GCTGGCAGGA CCTGGAATTC GTCTGAGGGG AGGGGGAGCT GCCACCTGCT 902 

TTCCCCCCTC CCCCAGCTCC AGCTTC TTTC AAGTGGAGCC AGCCGGCCTG GCCTGGTGGG 962 

ACAATACCTT TCACAAGCGG ACTCTCCCCT CCCCTTCCTC CACACCCCCT CTGCTTCCCA 1022 

AGGGAGGTGG GGACACCTCC AAGTGTTGAA CTTAGAACTG CAAGGGGAAT CTTCAAACTT 1082 

TCCCGC TGGA AuTTGTTTGC GCTTTGATTT GGTTTGATCA AGAGCAGGCA CCTGGGGGAA 1142 

GGATGGAAGA GAAAAGGGTG TGTGAAGGGT TTTTATGCTG GCCAAAGAAA TAACCACTCC 1202 

CACTGCCCAA CCTAGGTGAG GAGTGGTGGC TCCTGGCTCT GGGGAGAGTG GCAAGGGGTG 1262 

ACCTGAAGAG AGOTATACTG GTGCCAGGCT CCTCTCCATG GGGCAGCTAA TGAAACCTCG 1322 

CAGATCCCTT GCACCCCAGA ACCCTCCCCG TTGTGAAGAG GCAGTAGCAT TTAGAAGGGA 13 82 

GACAGATGAG GCTGGTGAGC TGGCCGCCTT TTCCAACACC GAAGGGAGGC AGATCAACAG 1442 

ATGAGCCATC TTGGAGCCCA GGTTTCCCCT GGAGCAGATG GAGGGTTCTG CTTTGTCTCT 1502 

CCTATGTGGG GCTAGGAGAC TCGCCTTAAA TGCCCTCTGT CCCAGGGATG GGGATTGGCA 1562 

CACAAGGAGC CAAACACAGC CAATAGGCAG AGAGTTGAGG GATTCACCCA GGTGGCTACA 1622 

GGCCAGGGGA AGTGGCTGCA GGGGAGAGAC CCAGTCACTC CAGCAGACTC CTGAGTTAAC 1682 

ACTGGGAAGA CATTQGCCAG TCC TAGTC AT CTCTCGGTCA GTAGGTCCGA GAGCTTCCAG 1742 

GCCCTGCACA GCCCTCCTTT CTCACCTGGG GGGAGGCAGG AGGTGATGGA GAAGCCTTCC 1802 

CATGCCGCTC ACAGGOGCCT CACGGGAATG CAGCAGCCAT GCAATTACCT GGAACTGGTC 1862 

CTGTGTTGGG GAGAAjAcAAG TTTTCTGAAG TCAGGTATGG GGCTGGGTGG GGCAGCTGTG 1922 

TGTTGGGGTG GC TTTTTTCT CTCTGTTTTG AATAATGTTT AC AATTTGC C TCAATCACTT 1982 

TTATAftffAAT CCACCTtECAG CCCGCCCCTC TCCCCACTCA GGCCTTCGAG GCTGTCTGAA 2042 

GATGCTTGAA AAACTCAACC AAATCCCAGT TCAACTCAGA CTTTGCACAT AT ATT TAT AT 2102 

TTATACTCAG AAAAGAAACA TTTCAGTAAT TTATAATAAA AGAGCACTAT TTTTTAATGA 2162 

AAAAAAAAAA AAAAAAAAAA AAAAA 2187 

(2) INFORMATION FOR SEQ ID NO: 8: 

( i ) SEQUENCES CHARACTERISTICS : 

(A) LENGTH: 225 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY r linear 

(ii) MOLECULE iTYPE : protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Val Thr His Ser Lys Phe Pro Ala Ala Gly Met Ser Arg Pro Leu 
1 5 I 10 15 

Asp Thr Ser Leu Arg lieu Lys Thr Phe Ser Ser Lys Ser Glu Tyr Gin 
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Leu Val Val Asn Ala Val Axg Lys Leu Gin Glu Ser Gly Phe Tyr Trp 
35 I .40 45 

Ser Ala Val frhr Gly Gly Glu Ala Asn Leu Leu Leu Ser Ala Glu Pro 
50 I 55 60 

Ala Gly Thr J>he Leu He Axg Asp Ser Ser Asp Gin Arg His Phe Phe 
65 I 70 75 80 

Thr Leu Ser \Jal Lys Thr Gin Ser Gly Thr Lys Asn Leu Arg lie Gin 
85 90 95 

Cys Glu Gly GlLy Ser Phe Ser Leu Gin Ser Asp Pro Arg Ser Thr Gin 
lBo 105 110 

Pro Val Pro AJrg Phe Asp Cys Val Leu Lys Leu Val His His Tyr Met 
115 J 120 125 

Pro Pro Pro G]ly Thr Pro Ser Phe Ser Leu Pro Pro Thr Glu Pro Ser 
130 I 135 140 



5 sj 

Li • 

m 



Ser Glu Val Prb Glu Gin Pro Pro Ala Gin Ala Leu Pro Gly Ser Thr 

145 150 155 160 

Pro Lys Arg Alt Tyr Tyr lie Tyr Ser Gly Gly Glu Lys lie Pro Leu 
165 170 175 

Val Leu Ser Ari Pro Leu Ser Ser Asn val Ala Thr Leu Gin His Leu 
180 185 190 



Cys Arg Lys Thxi Val Asn Gly His Leu Asp Ser Tyr Glu Lys Val Thr 
195 200 205 

Gin Leu Pro Gly) Pro lie Arg Glu Phe Leu Asp Gin Tyr Asp Ala Pro 
210 I 215 220 

Leu 
225 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 base pairs 

(B) TYPE: nucleic acid 
(D)| TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



ctccggctgg cccctJtctgt AGGATGGTAG CACACAACCA QGTGGCAGCC GACAATGCAG 60 

TCTCCACAGC AGCAGAGCCC CGACGGCGGC CAGAACCTTC CTCCTCTTCC TCCTCCTCGC 12 0 

CCGCGGCCCC CGCGCGCCCG CGGCCGTGCC CCGCGGTCCC GGCCCCGGCC CCCGGCGACA 180 

CGCACTTCCG CACATTCCGT TCGCACGCCG ATTACCGGCG CATCACGCGC GCCAGCGCGC 240 

TCCTGGACGC CTGCGOATTC TACTGGGGGC CCCTGAGCGT GChCGGGGCG CACGAGCGGC 300 

TGCGCGCCGA GCCCGTOGGC ACCTTCCTGG TGCGCGACAG CCGCCAGCGG AACTGCTTTT 3 60 

TCGCCCTTAG CGTGAAGATG GCCTCGGGAC CCACGAGCAT CCGCGTGCAC TTTCAGGCCG 420 

GCCGCTTTCA CCTGGATGGC AGCCGCGAGA GCTTCGACTG CCTCTTCGAG CTGC TGGAGC 480 
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ACTACGTGGC 


GGCGCCGCGC 


CGCATGCTGG 








540 


CGCTGCAGGA 


GCTGTGCCGC 


CAGCGCATCG 


TGGC CAC C GT 






cnn 
o w 


GCATCCCCCT 


CAAGCCCGTC 


CTCCGCGACT 


ACCTGAGCTC 


CTTCCCCTTC 


CAGATTTGAC 




CGGCAGCGCC 


CGCQGTGCAC 


GCAGCATTAA 


CTGGGATGCC 


GTGTTATTTT 


GTTATTACTT 


720 


GCCTGGAACC 


ATGTCGGTAC 


CCTCCCCGGC 


CTGGGTTGGA 


GGGAGCGGAT 


GGGTGTAGGG 


780 


GCGAGGCGCC 


TCCCGCCCTC 


GGCTGGAGAC 


GUGGCCGCAG 


ACCCCTTCTC 


ACCTCTTGAG 


840 


GGGGTCCTCC 


CCCTqCTGGT 


GCTCCCTCTG 


GGTCCCCCTG 


GTTGTTGTAG 


CAGCTTAACT 


900 


GTATC TGGAG 


CCAGGACCTG 


AACTCGCACC 


TCCTACCTCT 


TCATGTTTAC 


ATATACCCAG 


960 


TATCTTTGCA 


CAAACCAGGG 


GTTGGGGGAG 


GGTCTCTGGC 


TTTATTTTTC 


TGCTGTGCAG 


1020 


AATCCTATTT 


TATAT"ffTTTT 


AAAGTCAGTT 


TAGGTAATAA 


ACTTTATTAT 


GAAAGTTTTT 


1080 


TTTTTTAAAA 


AAAA 1 










1094 



(2) INFORMATION FOR SEQ ID NO: 10: 

( i ) SEQUENCE (CHARACTERISTICS : 

(A) LENGTH: 211 amino acids 

(B) TYPEL amino acid 

(C) STRA1IDEDNESS: single 

( D ) TOPoioGY : 1 inear 

(ii) MOLECULE ijYPE: protein 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met val Ala His, Asn Gin val Ala Ala Asp Asn Ala Val Ser Thr Ala 
5 10 15 



Ala Glu Pro Arc 
20 

Pro Ala Ala Pro 
35 



Ala Pro Gly Asp 
50 

Arg Arg lie Thr 
65 

Trp Gly Pro Leu 



Arg Arg Pro Glu Pro Ser Ser Ser Ser Ser Ser Ser 



25 



30 



Ala Arg Pro Arg Pro Cys Pro Ala Val Pro Ala Pro 
40 45 

Thr His Phe Arg Thr Phe Arg Ser His Ala Asp Tyr 
55 60 

\xq Ala Ser Ala Leu Leu Asp Ala Cys Gly Phe Tyr 
70 75 80 

^er Val His Gly Ala His Glu Arg Leu Arg Ala Glu 
fjs 90 95 
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Pro Val Gly Thr Phe Leu Val Arg 
100 

Phe Ala Leu Ser Val Lys Met Ala 
115 120 



Asp Ser Arg Gin Arg Asn Cys Phe 
105 110 

Ser Gly Pro Thr Ser lie Arg Val 
125 



His Phe Gin Ala Gly Arg Phe His Leu Asp Gly Ser Arg Glu Ser Phe 
130 135 140 

Asp Cys Leu Phe Glu Leu Leu Glu His Tyr Val Ala Ala Pro Arg Arg 
145 150 155 160 

Met Leu Gly Ala Pro Leu Arg Gin Arg Arg Val Arg Pro Leu Gin Glu 
165 170 175 

fi% Leu Cys Arg Gin Arg lie Val Ala Thr Val Gly Arg Glu Asn Leu Ala 

-lQ 1B0 185 190 



Arg lie Pro Leu Asn Pro Val Leu Arg Asp Tyr Leu Ser Ser Phe Pro 
195 200 205 

Phe Gin He 
210 



UJ (2) INFORMATION FOR SEQ ID NO: 11: 

tf? (i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 2807 base pairs 

(B) TYPE: nucleic acid 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



GGAAACCGAG GCGGGGAGAC CAGGAGGCCT TGGCCTCAGA GCTTCAGAGT CGCGTGGCAG 60 

CAAACAGAGA AACCTGTAGA GGGCAGTGTG CGTCACTTAG CTCAGGGAAG CTGCACGCGA 120 

AACTCACCCG CCTTCATTCA TAAACATCGT CAGCTAGGCA CCTACTCCTG GGCTTTCAGG 180 

ACAAACTGAA TCACGAAACC ACAGTGTCCT TAAAATAGGT CTGACCGCCT GAATCCCTGG 240 

CCAAGGTGTG TACGGGGCAT GGGAGCCCTT GTGCAGAGAT GCTTGCAGGA GCCTTGAGGG 300 

GCTCTGTAAG ACAGAGGCTA GGAAGACAAA GTTGGGGGCT ACAGCTTCTT GTCCTGCCCG 3 60 

GGGCCTCAGT TTCTTCGGTT GCCCACGTAG GAGTGCAGAG AGTCCAGCCC CTGGGGACCC 420 

AACCCAACCC CGCCCAGTTT CCGAGGAACT CGTCCGGGAG CGGGGGCGCC CCTCCCGCAC 4 80 

CGCCTTAGGC TTCCTTTGAA GCCTCTGCGG TCAGGCCACC GCTTCCTGGG AAGCCC AAGC 540 
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CAAGGCCAGG CCGAGTGGCC AACGGGAGGG GCCCGCGCGC GATTCTGGAG GAGGGCGGCG 600 

GCCCCACAGG TCTCCAGGGC TGGCTAGCCG GGCTCCTAGA GCGGAGACTG CCAAGGCCTT 660 

CGGGTCCTGG GCAGGAAGGA TCCTGGCAGG GAGGAGTTGC TTGGGGGGTG GGGGGGAAAG 72 0 

GCTCCAGGCG CGGTGGAGCT CTGACCAGGA GAATGCACAC ACTCGGAGGG GAGGAGGCGT 780 

GTCAGCCCCA AGCTAGCATC CCACCCGGGG AGCAGCGATG TGGGGCGAAG GTAGCCAGAG 84 0 

CAAAAGAGCA GGCACCAGGT GACACGAAAC AGAAGATTCC GGGTAGAGCC AGAACCCCAG 900 

AAGTC C C ATT CAGGGAAGGT GCGAGGCGAG AACGAGTTAG GTGGACCCTC TCCAGGGGCA 960 

GCCAAAGAAA TCTAAAGAGA ACCCGAAGGA CTTGCCGGAA AGAGAAACCG AAAGCGGCGG 1020 

TGGGCGGGAT CGGTGGGCGG GGCCTCCCTG GTTTAAGAGC TTGATGCAGG GGCGGGCAGC 1080 

AGCAGAGAGA ACTGCGGCCG TGGCAGCGGC ACGGCTCCCG GCCCCGGAGC ATGCGCGACA 1140 

GCAGCCCCGG AACCCCCAGC CGCGGCGCCC CGCGTCCCGC CGCCAGGTGA GCCGAGGCAG 1200 

CTGCGAAGGA GCAGGCGGGA GGGGATGGGA GGAAGGGGAG CAGAGCCTGG CAGGACTATC 12 60 

CTCGCAGACT GCATGGCGGG GTCGTGGATG CTATGCCTCT GGCGCCCGCC CCACCGGCTG 1320 

GCCCAGGCGG CCCCTCGCGC GCGCGGGGCG CCGTCAGCCC CTCCTCTCCG GCCCTGAGCC 13 80 

CGGATCGTCC GCCCGGGTTC CAGTTCCCGG CGTGGCCAGT AGGCGGCAAC CGCGAGGCGG 144 0 

CAAGCCACCC AGCGGGGACG GCCTGGAGTC GGGCCCCTCT CCACGCCCCC TTCTCCACGC 1500 

y GCGCGGGGAG GCAGGGCTCC ACCGCCAGTC TGGAAGGGTT CCACATACAG GAACGGCCTA 1560 

~% CTTCGCAGAT GAGCCCACCG AGGCTCAGGC TCCGGGCGGA TTCTGCGTGT CACCCTCGCT 162 0 

s5 CCTTGGGGTC CGCTGGCCGG CCTGTGCCAC CCGGACGCCC GGTTCACTGC CTCTGTCTCC 1680 

fy CCCATCAGCG CAGCCCCGGA CGCTATGGCC CACCCCTCCA GCTGGCCCCT CGAGTAGGAT 1740 

\f\ GGTAGCACGT AACCAGGTGG AAGCCGACAA TGCGATCTCC CCGGCATCAG AGCCCCGACG^ 180 0 

gl GCGGCCAGAG CCATCCTCGT CCTCGTCTTC GTCCTCGCCG GCGGCCCCGG CGCGTCCCCG 1860 

p GCCCTGCCCG GTGGTC CC GG CCCCGGCTCC GGGCGACACT CACTTCCGCA CCTTCCGCTC 1920 

s CCACTCTGAT TACCGGCGCA TCACGCGGAC CAGCGCTCTC CTGGACGCCT GCGGCTTCTA 1980 

M CTGGGGACCC CTGAGCGTGC ATGGGGCGCA CGAACGGCTG CGTTCCGAAC CCGTGGGCAC 2 040 

Q CTTCTTGGTG CGCGACAGTC GCCAGCGGAA CTGCTTCTTC GCGCTCAGCG TGAAGATGGC 2100 

UJ TTCGGGCCCC ACGAGCATTC GTGTGCACTT CCAGGCCGGC CGCTTCCACC TGGACGGCAA 2160 

^ CCGCGAGACC TTCGACTGCC TCTTCGAGCT GCTGGAGCAC TACGTGGCGG CGCCGCGCCG 2220 

HL? . CATGTTGGGG GCCCCACTGC GCCAGCGCCG CGTGCGGCCG CTGCAGGAGC TGTGTCGCCA 2280 

GCGCATCGTG GCCGCCGTGG GTCGCGAGAA CCTGGCACGC ATCCCTCTTA ACCCGGTACT 2340 

CCGTGACTAC CTGAGTTCCT TCCCCTTCCA GATCTGACCG GCTGCCGCCG TGCCCGCAGA 2400 

ATTAAGTGGG AGCGCCTTAT TATTTCTTAT TATTAATTAT TATTATTTTT CTGGAACCAC 2460 

GTGGGAGCCC TCCCCGCCTA GGTCGGAGGG AGTGGGTGTG GAGGGTGAGA TCCCTCCCAC 2520 

TTCTGGCTGG AGACC TTATC CCGCCTCTCG GGGGGCCTCC CCTCCTGGTG CTCCCTCCCG 2580 

GTCCCCCTGG TTGTAGCAGC TTGTGTCTGG GGCCAGGACC TGAACTCCAC GCCTACCTCT 2640 

CCATGTTTAC ATGTTCCCAG TATCTTTGCA CAAACCAGGG GTGGGGGAGG GTCTCTGGCT 2700 

TCATTTTTCT GCTGTGCAGA ATATTCTATT TT AT A TT TTT ACATCCAGTT TAGATAATAA 2760 

AC TTTATTAT GAAAGTTTTT TTTTTTAAAG AAACAAAGAT TTCTAGA 2807 



(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

Met Val Ala Axg Asn Gin Val Glu Ala Asp Asn Ala lie Ser Pro Ala 
15 10 15 

Ser Glu Pro Arg Axg Arg Pro Glu Pro Ser Ser Ser Ser Ser Ser Ser 
20 25 30 

Ser Pro Ala Ala Pro Ala Arg Pro Arg Pro Cys Pro Val Val Pro Ala 
35 40 45 

Pro Ala Pro Gly Asp Thr His Phe Arg Thr Phe Arg Ser His Ser Asp 
50 55 60 

Tyr Arg Arg lie Thr Arg Thr Ser Ala Leu Leu Asp Ala Cys Gly Phe 
65 70 75 80 

Tyr Trp Gly Pro Leu Ser Val His Gly Ala His Glu Arg Leu Arg Ser 
8S 90 95 

-Glu Pro val Gly Thr Phe Leu Val Arg Asp Ser Arg Gin Arg Asn Cys 
100 105 110 

Phe Phe Ala Leu Ser Val Lys Met Ala Ser Gly Pro Thr Ser lie Arg 
115 120 125 

Val His Phe Gin Ala Gly Arg Phe His Leu Asp Gly Asn Arg Glu Thr 
130 135 140 

Phe Asp Cys Leu Phe Glu Leu Leu Glu His Tyr Val Ala Ala Pro Arg 
145 150 155 160 

Arg Met Leu Gly Ala Pro Leu Arg Gin Arg Arg Val Arg Pro Leu Gin 
165 170 175 



Glu Leu Cys Arg Gin Arg lie Val Ala Ala Val Gly Arg Glu Asn Leu 
180 185 190 
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Ala Arg lie Pro Leu Asn Pro Val Leu Arg Asp Tyr Leu Ser Ser Phe 
195 200 205 

Pro Phe Gin lie 
210 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1611 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 263.. 1529 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CGAATTCCGG GCGGGCTGTG TGAGTCTGTG AGTGGAAGGC GCGCCGGCTC TTTTGTC TGA 60 

GTGTGACCCG GTGGC TTTGT TCCAGGCATT CCGGTGATTT CCTCCGGGCA GTCCGCAGAA 120 

GCCGCAGCGG CCGCCCGCGC TCTCTCTGCA GTCTCCACAC CCGGGAGAGC CTGAGCCCGC 180 

GTCACGCCCC TCAGCCCCCG CTGAGTCCCT TCTCTGTTGT CGCGTCCGAA TCGAGTTCCC 240 

GGAATCAGAC GGTGCCCCAT AG ATG GCC AGC TTT CCC CCG AGG GTT AAC GAG 292 

Met Ala Ser Phe Pro Pro Arg Val Asn Glu 
15 10 

AAA GAG ATC GTG AGA TCA CGT ACT ATA GGG GAA CTC TTG GCT CCA GCA 340 
Lys Glu lie Val Arg Ser Axg Thr lie Gly Glu Leu Leu Ala Pro Ala 
15 20 25 



GCT CCT TTT GAC AAG AAA TGT GGT GGT GAG AAC TGG ACG GTT GCT TTT 
Ala Pro Phe Asp Lys Lys Cys Gly Gly Glu Asn Trp Thr Val Ala Phe 
30 35 40 



388 
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GCT CCT GAT GGT TCC TAC TTT GCG TGG TCA CAA GGA TAT CGC ATA GTG 436 
Ala Pro Asp Gly Ser Tyr Phe Ala Trp Ser Gin Gly Tyr Arg lie Val 
45 50 55 

AAG CTT GTC CCG TGG TCC GAG TGC CGT AAG AAC TTT CTT TTG CAT GGT 484 
Lys Leu Val Pro Trp Ser Gin Cys Arg Lys Asn Phe Leu Leu His Gly 
60 65 70 

TCC AAA AAT GTT ACC AAT TCA AGC TGT CTA AAA TTG GCA AGA CAA AAC 532 
Ser Lys Asn Val Thr Asn Ser Ser Cys Leu Lys Leu Ala Arg Gin Asn 
75 80 65 90 

AGT AAT GGT GGT CAG AAA AAC AAG CCT CCT GAG CAC GTT ATA GAC TGT 580 
Ser Asn Gly Gly Gin Lys Asn Lys Pro Pro Glu His Val lie Asp Cys 
95 100 105 

GGA GAC ATA GTC TGG AGT CTT GCT TTT GGG TCT TCA GTT CCA GAA AAA 628 
Gly Asp He Val Trp Ser Leu Ala Phe Gly Ser Ser Val Pro Glu Lys 
110 115 120 

CAG AGT CGT TGC GTT AAT ATA GAA TGG CAT CGG TTC CGA TTT GGA CAG 676 

Gin Ser Arg Cys Val Asn He Glu Trp His Arg Phe Arg Phe Gly Gin 
125 130 135 

GAT CAG CTA CTC CTT GCC ACA GGA TTA AAC AAT GGT CGC ATC AAA ATC 724 
Asp Gin Leu Leu Leu Ala Thr Gly Leu Asn Asn Gly Arg He Lys He 
140 145 150 

TGG eftT* GTA TAT ACA GGA AAA CTC CTC CTT AAT TTG GTA GAC CAC ATT 772 
Trp Asp Val Tyr Thr Gly Lys Leu Leu Leu Asn Leu val Asp His He 
155 160 165 170 

GAA ATG GTT AGA GAT TTA ACT TTT GCT CCA GAT GGG AGC TTA CTC CTT 820 
Glu Met Val Arg Asp Leu Thr Phe Ala Pro Asp Gly Ser Leu Leu Leu 
175 180 185 

GTA TCA GCT TCA AGA GAC AAA ACT CTA AGA GTG TGG GAC CTG AAA GAT 868 
Val Ser Ala Ser Arg Asp Lys Thr Leu Arg Val Trp Asp Leu Lys Asp 
190 195 200 

GAT GGA AAC ATG GTG AAA GTA TTG CGG GCA CAT CAG AAT TGG GTG TAC 916 
Asp Gly Asn Met Val Lys Val Leu Arg Ala His Gin Asn Trp Val Tyr 
205 210 215 

AGT TGT GCA TTC TCT CCC GAC TGT TCT ATG CTG TGT TCA GTG GGC GCC 964 
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Ser Cys Ala Phe Ser Pro Asp Cys Ser Met Leu Cys Ser Val Gly Ala 
220 225 230 

AGT AAA GCA GTT TTC CTT TGG AAT ATG GAT AAA TAG ACC ATG ATT AGG 1012 
Ser Lys Ala Val Phe Leu Trp Asn Met Asp Lys Tyr Thr Met lie Arg 
235 240 245 250 

AAG CTG GAA GGT CAT CAC CAT GAT GTT GTA GCT TGT GAC TTT TCT CCT 1060 
Lys Leu Glu Gly His His His Asp Val Val Ala Cys Asp Phe Ser Pro 
255 260 265 

GAT GGA GCA TTG CTA GCT ACT GCA TCC TAT GAC ACT CGT GTG TAT GTC 1108 
Asp Gly Ala Leu Leu Ala Thr Ala Ser Tyr Asp Thr Arg Val Tyr Val 
270 275 280 

Q TGG GAT CCA CAC AAT GGA GAC CTT CTG ATG GAG TTT GGG CAC CTG TTT 1156 

Qi Trp Asp Pro His Asn Gly Asp Leu Leu Met Glu Phe Gly His Leu Phe 

iJS 285 290 295 

fn 

lU CCC TCG CCC ACT CCA ATA TTT GCT GGA GGA GCA AAT GAC CGA TGG GTG 1204 

Uj Pro Ser Pro Thr Pro lie Phe Ala Gly Gly Ala Asn Asp Arg Trp Val 

300 " 305 310 

f AGA GCT GTG TCT TTC AGT CAT GAT GGA CTG CAT GTT GCC AGC CTT GCT 1252 

j™ Arg Ala Val Ser Phe Ser His Asp Gly Leu His Val Ala Ser Leu Ala 

rl 315 320 325 330 



GAT GAT AAA ATG GTG AGG TTC TGG AGA ATC GAT GAG GAT TGT CCG GTA 1300 
Asp Aspr Lys Met Val Arg Phe Trp Arg lie Asp Glu Asp Cys Pro Val 
335 340 345 

CAA GTT GCA CCT TTG AGC AAT GGT CTT TGC TGT GCC TTT TCT ACT GAT 1348 
Gin Val Ala Pro Leu Ser Asn Gly Leu Cys Cys Ala Phe Ser Thr Asp 
350 355 360 

GGC AGT GTT TTA GCT GCT GGG AGA CAT GAT GGA AGT GTG TAT TTT TGG 1396 
Gly Ser Val Leu Ala Ala Gly Thr His Asp Gly Ser Val Tyr Phe Trp 
365 370 375 

GCC ACT CCA AGG CAA GTC CCT AGC CTT CAA CAT ATA TGT CGC ATG TCA 1444 
Ala Thr Pro Arg Gin Val Pro Ser Leu Gin His lie Cys Arg Met Ser 
380 385 390 

ATC CGA AGA GTG ATG TCC ACC CAA GAA GTC CAA AAA CTG CCT GTT CCT 1492 
lie Arg Arg Val Met Ser Thr Gin Glu Val Gin Lys Leu Pro Val Pro 
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395 400 405 410 

TCC AAA ATA TTG GCG TTT CTC TCC TAC CGC GGT TAG A CTGAAGACTG 
Ser Lys lie Leu Ala Phe Leu Ser Tyr Arg Gly * 
415 420 

CCTTTCCTGG TAGGCCTGCC AGACAGAGCG CCCTTTACAA GACACACCTC AAGCTTTACC 

TCGTGCCGAA TT 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Ala Ser Phe Pro Pro Arg Val Asn Glu Lys Glu lie Val Arg Ser 
1 5 10 15 

Arg Thr lie Gly Glu Leu Leu Ala Pro Ala Ala Pro Phe Asp Lys Lys 
20 25 30 

Cys Sxy- Gly Glu Asn Trp Thr Val Ala Phe Ala Pro Asp Gly Ser Tyr 
35 40 45 

Phe Ala Trp Ser Gin Gly Tyr Arg lie Val Lys Leu Val Pro Trp Ser 
50 55 60 

Gin Cys Arg Lys Asn Phe Leu Leu His Gly Ser Lys Asn Val Thr Asn 
65 70 75 80 

Ser Ser Cys Leu Lys Leu Ala Arg Gin Asn Ser Asn Gly Gly Gin Lys 
85 90 95 

Asn Lys Pro Pro Glu His Val lie Asp Cys Gly Asp lie Val Trp Ser 
100 105 110 



Leu Ala Phe Gly Ser Ser Val Pro Glu Lys Gin Ser Arg Cys Val Asn 
115 120 125 
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Ile Glu Trp His Arg Phe Arg Phe Gly Gin Asp Gin Leu Leu Leu Ala 
130 135 140 

Thr Gly Leu Asn Asn Gly Arg lie Lys He Trp Asp Val Tyr Thr Gly 
145 150 155 160 

Lys Leu Leu Leu Asn Leu Val Asp His He Glu Met Val Arg Asp Leu 
165 170 175 

Thr Phe Ala Pro Asp Gly Ser Leu Leu Leu Val Ser Ala Ser Arg Asp 
180 185 190 

Lys Thr Leu Arg Val Trp Asp Leu Lys Asp Asp Gly Asn Met Val Lys 
195 200 205 

Val Leu Arg Ala His Gin Asn Trp Val Tyr Ser Cys Ala Phe Ser Pro 
210 215 220 

Asp Cys Ser Met Leu Cys Ser Val Gly Ala Ser Lys Ala Val Phe Leu 
225 230 235 240 

Trp Asn Met Asp Lys Tyr Thr Met He Arg Lys Leu Glu Gly His His 
245 250 255 

His Asp Val Val Ala Cys Asp Phe Ser Pro Asp Gly Ala Leu Leu Ala 
260 265 270 

Thr Ala Ser Tyr Asp Thr Arg Val Tyr Val Trp Asp Pro His Asn Gly 
275 280 285 

Asp Leu Leu Met Glu Phe Gly His Leu Phe Pro Ser Pro Thr Pro He 
290 295 300 

Phe Ala Gly Gly Ala Asn Asp Arg Trp Val Arg Ala Val Ser Phe Ser 
305 310 315 320 

His Asp Gly Leu His Val Ala Ser Leu Ala Asp Asp Lys Met Val Arg 
325 330 335 

Phe Trp Arg He Asp Glu Asp Cys Pro Val Gin Val Ala Pro Leu Ser 
340 345 350 



Asn Gly Leu cys cys Ala Phe Ser Thr Asp Gly Ser Val Leu Ala Ala 
355 360 365 
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Gly Thr His Asp Gly Ser Val Tyr Phe Trp Ala Thr Pro Arg Gin Val 
370 375 380 

Pro Ser Leu Gin His lie Cys Arg Met Ser lie Arg Arg Val Met Ser 
385 390 395 400 

Thr Gin Glu Val Gin Lys Leu Pro Val Pro Ser Lys He Leu Ala Phe 
405 410 415 

Leu Ser Tyr Arg Gly * 
420 

(2) INFORMATION FOR SEQ ID NO: 15: 

r? (i) SEQUENCE CHARACTERISTICS : 
2 (A) LENGTH: 7 83 base pairs 

g% (B) TYPE: nucleic acid 

fy (C) STRANDEDNESS : single 

IF! (D) TOPOLOGY: linear 

is 5 

^ (ii) MOLECULE TYPE: DNA 

3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CTGTCTTCCT CCGCAGCGCG AGGCTGGGTA CAGGGTCTAT TGTCTGTGGT TGACTCCGTA 60 

CTTTGGTCTG AGGCCTTCGG GAGCTTTCCC GAGGCAGTTA GCAGAAGCCG CAGCGACCGC 120 

CCCCGCCCGT CTCCTCTGTC CCTGGGCCCG GGAGACAAAC TTGGCGTCAC GCCCTCAGCG 180 

GTCGCCACTC TC TTCTCTGT TGTTGGGTCC GCATCGTATT CCCGGAATCA GACGGTGCCC 240 

CATAGATGGC CAGCTTTCCC C CGAGGGTC A ACGAGAAAGA GATCGTGAGA TCACGTACTA 300 

TAGGTGAACT TTTAGCTCCT GCAGCTCCTT TTGACAAGAA ATGTGGTCGT GAAAATTGGA 3 60 

CTGTTGCTTT TGCTCCAGAT GGTTCATACT TTGCTTGGTC ACAAGGACAT CGCACAGTAA 420 

AGCTTGTTCC GTGGTCCCAG TGCCTTCAGA ACTTTCTCTT GCATGGCACC AAGAATGTTA 480 

CCAATTCAAG CAGTTTAAGA TTGCCAAGAC AAAATAGTGA TGGTGGTCAG AAAAATAAGC 540 

CTCGTGACAT ATTATAGACT GTGGAGATAT AGTCTGGAGT CTTGCTTTTG GGTCATCAGT 600 
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TCCAGAAAAA CAGAGTCGCT GTGTAAATAT AGAATGGCAT CGCTTCAGAT TTGGACAAGA 660 

TC AG C T AC TT CTTGCTACAG GGTTGAACAA TGGGCGTATC AAAATATGGG ATGTATATCA 720 

GGAAACTCCT CCTTAACTTG GTAGATCATA CTGAAGTGGT CAGAGATTTA ACTTTTGCTC 780 

CAG 783 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 1122 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

2? (ii) MOLECULE TYPE: DNA 

m < xi > SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Q CTCTGTATGT CTGAATGAAG CTATAACATT TGCCTTTTTA TTGCAGGTTT TCCTTTGGAA 60 

TATGGATAAA TACACCATGA T AC GGAAACT AGAAGGACAT CACCATGATG TGGTAGCTTG 120 

W TGACTTTTCT CCTGATGGAG CATTACTGGC TACTGCATCT TATGATACTC GAGTATATAT 180 

O - CTGGSftTCCA CATAATGGAG ACATTCTGAT GGAATTTGGG C AC CTG TT TC CCCCACCTAC ?40 

TCCAATATTT GCTGGAGGAG CAAATGACCG GTGGGTACGA TCTGTATCTT TTAGCCATGA 300 

TGGACTGCAT GTTGCAAGCC TTGCTGATGA TAAAATGGTG AGGTTCTGGA GAATTGATGA 360 

GGATTATCCA GTGCAAGTTG CACCTTTGAG C AATGGTC TT TGCTGTGCCT TCTCTACTGA 420 

TGGCAGTGTT TTAGCTGCTG GGACACATGA CGGAAGTGTG TATTTTTGGG CCACTCCACG 480 

GCAGGTCCCT AGCCTGCAAC ATTTATGTCG CATGTCAATC CGAAGAGTGA TGCCCACCCA 540 

AGAAGTTCAG GAGCTGCCGA TTCCTTCCAA GCTTTTGGAG TTTCTCTCGT ATCGTATTTA 600 

GAAGATTCTG CCTTCCCTAG TAGTAGGGAC TGACAGAATA CACTTAACAC AAACCTCAAG 660 

CTTTACTGAC TTCAATTATC TGTTTTTAAA GACGTAGAAG ATTTATTTAA TTTGATATGT 720 
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TCTTGTACTG CATTTTGATC AGTTGAGCTT TTAAAATATT ATTTATAGAC AATAGAAGTA 



780 



TTTCTGAACA TATCAAATAT AAATTTTTTT AAAGATCTAA CTGTGAAAAC ATACATACCT 



840 



GTACATATTT AGATATAAGC TGCTATATGT TGAATGGACC CTTTTGCTTT TCTGATTTTT 



900 



AGTTC TGAC A TGTATATATT GCTTCAGTAG AGCCACAATA TGTATCTTTG CTGTAAAGTG 



960 



CAAGGAAATT TTAAATTCTG GGACACTGAG TTAGATGGTA AATACTGACT TACGAAAGTT 1020 
GAATTGGGTG AGGCGGGCAA ATCACCTGAG GTCAGCAGTT TGAGACTAGC CTGGCAAACA 1080 



TGATGAAACC CTGTCTCTAC TAAAAATACA AAAAAAAAAA AA 



1122 



Ql 
t y 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 537 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 422.. 2029 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



CGGCACGAGC CGGGCTCCGT CCGGAGGAAG CGAGGCTGCG CCGCCGGCCC GGCAGGAGCG 



60 



GAGGACGGGA GCGCGGGCGG TCGCGCTCGC CCTGTCGCTG ACTGCGCTGC CCCGGCCCAT 120 

CCTTGCCTGG CCGCAGGTGC CCTGGATGAG GCCGCCGCGC GTGTCCCGGC CGCTGAGTGT 180 

CCCCCGCGGT CGCCCGGCGC CTGCCCTCAA GCGGCCGCCT CTCCTTGCCC GGGTCCCCGT 240 

TTTCCCCCGG CGCAGTCCTC CTCCGGTGGG CGCCTCCGCA CCTCGGCGCA GGCGGCACGG 300 



CCCTCGGGCC GGGATGGATC CGCCGGGAAG AGGAAGACAA GCCGGGGCGT TGAGCCCCTG 



360 
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CGCACGGTGC CGCCGCGCGT AGTGGGAGCT TACTCGCAGT AGGCTC TCGC TCTTCTAATC 420 

A ATG GAT AAA GTG GGG AAA ATG TGG AAC AAC TTA AAA TAC AGA TGC 466 
Met Asp Lys Val Gly Lys Met Trp Asn Asn Leu Lys Tyr Arg Cys 
15 10 15 

CAG AAT CTC TTC AGC CAC GAG GGA GGA AGC CGT AAT GAG AAC GTG GAG 514 
Gin Asn Leu Phe Ser His Glu Gly Gly Ser Arg Asn Glu Asn Val Glu 
20 25 30 

ATG AAC CCC AAC AGA TGT CCG TCT GTC AAA GAG AAA AGC ATC AGT CTG 5 62 

Met Asn Pro Asn Arg Cys Pro Ser val Lys Glu Lys Ser lie Ser Leu 
35 40 45 

Q GGA GAG GCA GCT CCC CAG CAA GAG AGC AGT CCC TTA AGA GAA AAT GTT 610 

^ Gly Glu Ala Ala Pro Gin Gin Glu Ser Ser Pro Leu Arg Glu Asn Val 

« 50 55 60 



GCC TTA CAG CTG GGA CTG AGC CCT TCC AAG ACC TTT TCC AGG CGG AAC 658 
Ala Leu Gin Leu Gly Leu Ser Pro Ser Lys Thr Phe Ser Arg Arg Asn 
65 70 75 

CAA AAC TGT GCC GCA GAG ATC CCT CAA GTG GTT GAA ATC AGC ATC GAG 706 
Gin Asn Cys Ala Ala Glu lie Pro Gin Val Val Glu lie Ser lie Glu 
80 85 90 95 

AAA GAC AGT GAC TCG GGT GCC ACC CCA GGA ACG AGG CTT GCA CGG AGA 754 
Lys Asp Ser Asp Ser Gly Ala Thr Pro Gly Thr Arg Leu Ala Arg Arg 
100 105 110 

GAC TCC TAC TCG CGG CAC GCC CCG TGG GGA GGA AAG AAG AAA CAT TCC 802 
Asp Ser Tyr Ser Arg His Ala Pro Trp Gly Gly Lys Lys Lys His Ser 
115 120 125 

TGT TCC ACA AAG ACC CAG AGT TCA TTG GAT ACC GAG AAA AAG TTT GGT 850 
Cys Ser Thr Lys Thr Gin Ser Ser Leu Asp Thr Glu Lys Lys Phe Gly 
130 135 140 

AGA ACT CGA AGC GGC CTT CAG AGG CGA GAG CGG CGC TAT GGA GTC AGC 898 
Arg Thr Arg Ser Gly Leu Gin Arg Arg Glu Arg Arg Tyr Gly Val Ser 
145 150 155 

TCC ATG CAG GAC ATG GAC AGC GTT TCT AGC CGC GCG GTC GGG AGC CGC 946 
Ser Met Gin Asp Met Asp Ser Val Ser Ser Arg Ala Val Gly Ser Arg 
160 165 170 175 



- r / . i c . ^ c 
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TCC CTG AGG CAG AGG CTC CAG GAC ACG GTG GGT TTG TGT TTT CCC ATG 
Ser Leu Arg Gin Arg Leu Gin Asp Thr Val Gly Leu Cys Phe Pro Met 
180 185 190 



994 



AGA ACT TAG AGC AAG CAG TCA AAG CCA CTC TTT TCC AAT AAA AGA AAA 
Arg Thx Tyr Ser Lys Gin Ser Lys Pro Leu Phe Ser Asn Lys Arg Lys 
195 200 205 



1042 



ATA CAT CTT TCT GAA TTA ATG CTG GAG AAA TGC CCT TTT CCT GCT GGC 
He His Leu Ser Glu Leu Met Leu Glu Lys Cys Pro Phe Pro Ala Gly 
210 215 220 



1090 



TCG GAT TTA GCA CAA AAG TGG CAT TTG ATT AAA CAG CAT ACC GCC CCT 
Ser Asp Leu Ala Gin Lys Trp His Leu He Lys Gin His Thr Ala Pro 
225 230 235 



1138 



GTG AGC CCA CAC TCA ACA TTT TTT GAT ACA TTT GAT CCA TCA CTG GTG 
Val Ser Pro His Ser Thr Phe Phe Asp Thr Phe Asp Pro Ser Leu Val 
240 245 250 255 



1186 



TCT ACA GAA GAT GAA GAA GAT AGG CTT CGC GAG AGA AGA CGG CTT AGT 
Ser Thr Glu Asp Glu Glu Asp Arg Leu Arg Glu Arg Arg Arg Leu Ser 
260 265 270 



1234 



ATC GAA GAA GGG GTG GAT CCC CCT CCC AAC GCA CAA ATA CAC ACC TTT 
He Glu Glu Gly Val Asp Pro Pro Pro Asn Ala Gin He His Thr Phe 
275 280 285 



1282 



GAA eeT-ACT GCA CAG GTC AAC CCA TTG TAT AAG CTG GGA CCA AAG TTA 
Glu Ala Thr Ala Gin Val Asn Pro Leu Tyr Lys Leu Gly Pro Lys Leu 
290 295 300 



1330 



GCT CCT GGG ATG ACA GAG ATA AGT GGA GAT GGT TCT GCA ATT CCA CAA 
Ala Pro Gly Met Thr Glu He Ser Gly Asp Gly Ser Ala He Pro Gin 
305 310 315 



1378 



GCA ATT GTG ACT CAG AAG AGG ATT CAA CCA CCC TAT GTC TGC AGT CAC 
Ala He Val Thr Gin Lys Arg He Gin Pro Pro Tyr val Cys Ser His 
320 325 330 335 



1426 



GGA GGC AGA AGC AGC GCC AGG TGT CCG GGG ACA GCC ACG CGC ACC TTA 
Gly Gly Arg Ser ser Ala Arg Cys Pro Gly Thr Ala Thr Arg Thr Leu 
340 345 350 



1474 



GCA GAC AGG GAG CTT GGA AAG TTC ATA CGC AGA TCG ATT ACA TAC ACT 



1522 



-130- 

Ala Asp Arg Glu Leu Gly Lys Phe He Arg Arg Ser He Thr Tyr Thr 
355 360 365 



GCC TCG TGC CAG ATT TGC TTC AGA TCA CAG GGA ATC CCT GTT ACT GGG 
Ala Ser Cys Gin He Cys Phe Arg Ser Gin Gly He Pro Val Thr Gly 
370 375 380 



1570 



GCG TGA TGG ACC GAT ACG AGG CCG AAG CCC TTC TAG AAG GGA AAC CGG 
Ala * Trp Thr Asp Thr Arg Pro Lys Pro Phe * Lys Gly Asn Arg 
385 390 395 



1618 



AAG GCA CGT TCT TGC TCA GGG ACT CTG CAC AGG AGG ACT ACC TCT TCT 
Lys Ala Arg Ser Cys Ser Gly Thr Leu His Arg Arg Thr Thr Ser Ser 
400 405 410 415 



1666 



0=? 



CTG TGA GCT TCC GCC GCT ACA ACA GGT CTC TGC ACG CCC GGA TCG AGC 
Leu * Ala Ser Ala Ala Thr Thr Gly Leu Cys Thr Pro Gly Ser Ser 
420 425 430 



1714 



AGT GGA ACC ACA ACT TCA GCT TCG ATG CCC ATG ACC CCT GCG TGT TTC 
Ser Gly Thr Thr Thr Ser Ala Ser Met Pro Met Thr Pro Ala Cys Phe 
435 440 445 



1762 



ACT CCT CCA CGT CAC GGG GCT TCT CGA ACA CTA TAA AGA CCC CAG CTC 
Thr Pro Pro Arg His Gly Ala Ser Arg Thr Leu * Arg Pro Gin Leu 
4S0 455 460 



1810 



TTG CAT GTT TTT TGA ACC GTT GCT AAC GAT ATC ACT GAA TAG AAC TTT 
Leu Kis—Val Phe * Thr Val Ala Asn Asp He Thr Glu * Asn Phe 
465 470 475 



1358 



CCC TTT CAG CCT GCA GTA TAT CTG CCG CGC AGT GAT CTG CAG ATG CAC 
Pro Phe Gin Pro Ala Val Tyr Leu Pro Arg Ser Asp Leu Gin Met His 
480 465 490 495 



1906 



TAC GTA TGA TGG GAT TGA CGG GCT CCC GCT ACC GTC GAT GTT ACA GGA 
Tyr Val * Trp Asp * Arg Ala Pro Ala Thr Val Asp Val Thr Gly 
500 505 510 



1954 



TTT TTT AAA AGA GTA TCA TTA TAA ACA AAA AGT TAG GGT TCG CTG GTT 
Phe Phe Lys Arg Val Ser Leu * Thr Lys Ser * Gly Ser Leu Val 
515 520 525 



2002 



AGA ACG AGA CCA GTC AAA GCA AAG TAACTCCTGT CCCCAAAGGG CACTAACTAA 
Arg Thr Arg Pro Val Lys Ala Lye 



2056 



- 131 - 

530 535 

GTCTGCTCCT CCCGTGCATC GAACTGCACC CATAGGAGGC AGTCAGCTGC TAGGATTTCC 2116 

CACCCAGAAT GGGAGCTTAG TCATTAGCCT CTGCCCTATG GGGTCCGCTG TTCCTCAGAC 2176 

AAAGGTGCCT AGGGACAGCA AGATGGC TTG CAGGTGTTCG GTGGGCTGTG ACAACTGAGG 2236 

GAGGCAACTC TGCGGCATTT GCTATGAAGA ATTCTATTTC TTACCGAAGA ACAAATTATT 2296 

AATATTGGAT GGGTATTTCA ATAGTGTGAC TAATGTTTGA AATTATTTTT TCTAAGAATT 2356 

TTTCTATAAC CTTCAGAAAA AGTAGTGATG TTTG TAG TT A CTATAAATCA AGO T TTG AAA 2416 

GTTCAAAACA AACAAGTTAA ATAAAAGACT ACCTTCCTTT TAGAGAAAAC AAATGCAAGT 2476 

TTTCCCAGCC ACAGGCATTG TGCACTGTTA ATGTTGCTTG TTATCAGCTC CTTTCTCCTC 2536 

c 2537 



^ (2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
p (A) LENGTH: S35 amino acids 

hj (B) TYPE: amino acid 

H= (D) TOPOLOGY: linear 

Nl " • Tii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Asp Lys Val Gly Lys Met Trp Asn Asn Leu Lys Tyr Arg Cys Gin 
1 5 10 15 

Asn Leu Phe Ser His Glu Gly Gly Ser Arg Asn Glu Asn Val Glu Met 
20 25 30 

Asn Pro Aan Arg Cys Pro Ser Val Lys Glu Lys Ser lie Ser Leu Gly 
35 40 45 

Glu Ala Ala Pro Gin Gin Glu Ser Ser Pro Leu Arg Glu Asn Val Ala 
50 55 60 

Leu Gin Leu Gly Leu Ser Pro Ser Lys Thr Phe Ser Arg Arg Asn Gin 
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65 70 75 80 

Asn Cys Ala Ala Glu lie Pro Gin Val Val Glu lie Ser He Glu Lys 
85 90 95 

Asp Ser Asp Ser Gly Ala Thr Pro Gly Thr Arg Leu Ala Arg Arg Asp 
100 105 110 

Ser Tyr Ser Arg His Ala Pro Trp Gly Gly Lys Lys Lys His Ser Cys 
115 120 125 

Ser Thr Lys Thr Gin Ser Ser Leu Asp Thr Glu Lys Lys Phe Gly Arg 
130 135 140 

Thr Arg Ser Gly Leu Gin Arg Arg Glu Arg Arg Tyr Gly Val Ser Ser 
145 150 155 160 

Met Gin Asp Met Asp Ser Val Ser Ser Arg Ala Val Gly Ser Arg Ser 
165 170 175 

Leu Arg Gin Arg Leu Gin Asp Thr Val Gly Leu Cys Phe Pro Met Arg 
180 185 190 

Thr Tyr Ser Lys Gin Ser Lys Pro Leu Phe Ser Asn Lys Arg Lys lie 
195 200 205 

His Leu Ser Glu Leu Met Leu Glu Lys Cys Pro Phe Pro Ala Gly Ser 
210 215 220 

Asp Leu Ala Gin Lys Trp His Leu lie Lys Gin His Thr Ala Pro Val 
225 230 235 240 

Ser Pro His Ser Thr Phe Phe Asp Thr Phe Asp Pro Ser Leu Val Ser 
245 250 255 

Thr Glu Asp Glu Glu Asp Arg Leu Arg Glu Arg Arg Arg Leu Ser lie 
260 265 270 

Glu Glu Gly Val Asp Pro Pro Pro Asn Ala Gin lie His Thr Phe Glu 
275 280 285 

Ala Thr Ala Gin Val Asn Pro Leu Tyr Lys Leu Gly Pro Lys Leu Ala 
290 295 300 

Pro Gly Met Thr Glu He Ser Gly Asp Gly Ser Ala He Pro Gin Ala 
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305 



310 



315 



320 



II. Val Thr Gin Lys Arg II. Gin Pro Pro Tyr v»l Cys Ser His Gly 

325 330 335 



Gly Arg Ser Ser 
340 



Ala Arg Cys Pro Gly Thr Ala Thr Arg Thr Leu Ala 
345 350 

Asp Arg Glu Leu Gly Lys Phe He Arg Arg Ser He Thr Tyr Thr Ala 
355 36° 365 

Se r Cys Gin He Cys Phe Arg Ser Gin Gly He Pro Val Thr Gly Ala 
370 375 380 

Thr Asp Thr Arg Pro Lys Pro Phe • Lys Gly Asn Arg Lys 



* Trp 
385 



390 395 400 



Ala Arg Ser Cys Ser Gly Thr Leu His Arg Arg Thr Thr Ser Ser Leu 
405 410 415 



• Ala Ser Ala Ala 



Thr Thr Gly Leu Cys Thr Pro Gly Ser Ser Ser 
420 «5 «0 

Gly Thr Thr Thr Ser Ala Ser Met Pro Met Thr Pro Ala Cys Phe Thr 
435 440 445 

Pro Pro Arg His Gly Ala Ser Arg Thr Leu * Arg Pro Gin Leu Leu 
450 455 460 

His Val Phe * Thr Val Ala Asn Asp He Thr Glu * Asn Phe Pro 

Ain 475 480 

465 470 *' 3 

Phe Gin Pro Ala Val Tyr Leu Pro Arg Ser Asp Leu Gin Met His Tyr 
485 490 495 

Val * Trp Asp * Arg Ala Pro Ala Thr Val Asp Val Thr Gly Phe 
500 505 510 

Phe Lys Arg Val Ser Leu - Thr Lys Ser * Gly Ser Leu Val Arg 
515 520 52S 



Thr Arg Pro Val Lys Ala Lys 
530 535 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY = 1 inear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GATTAAACAG CATACAGCTC CTGTGAGCCC ACATTCAACA TTTTTTGATA CTTTGATCCA 60 

TCTTTGGTTT CTACAGAAGA TGAAGAAGAT AGGCTTAGAG AGAGAAGGCG GCTTAGTATT 120 

GAAGAAGGGG TTGATCCCCC TCCCAATGCA CAAATACATA CATTTGAAGC TACTGCACAG 180 

GTTAATCCAT TATTAAACTG GGACCAAAAT TAG CTC C TGG AATGACTGAA ATAAGTGGGG 240 

ACAGTTC TGC AATTCCACAA GCTAATTGTG ACTCGGAAGA GGATACAACC ACCCTGTGTT 30 0 

GCAGTCACGG AGGCAGAAGC AGCGTCAGAT ATCTGGAGAC AGCCATACCC ATGTTAGCAG 360 

ACAGGGAGCT TGGAAAGTCC ACACACAGAT TGATTACATA CACTGCTTCG TGCCTGATTT 420 

GCTTeftftATT ACAGGGAATC CCTGTTACTG GGGAGTGATG GACCGTTATG AAGCAGAAGC 480 

CCTTCTCGAA GGGAAACCTG AAGGCACGTT TTTGCTCAGG GACTCTGCGC AAGAGGACTA 540 

CTTCTTCTCT GTGAGCTTCC GCCGATACAA CAGATCCCTG CATGCC CGAA TTGAGCAGTG 600 

GAATCACAAC TTTAGTTTCG ACGCCCATGA CCCGTGTGTA TTTCACTCCT CC AC TGTAAC 660 

GGGACTTTTA GAACATTATA AAGATCCCAG TTCGTGCATG TTTTTTGAAC CATTGCTTAC 720 

TATATCACTA AATAGGACTT TCCCTTTTAG CCTGCAGTAT ATCTGTCGCG CGGTAATCTG 780 

CAGGTGCACT ACGTATGATG GAATTGATGG GCTCCCTCTA CCCTCAATGT TACAGGATTT 840 

TTTAAAAGAG TATCATTATA AACAAAAAGT TAGAGTTCGC TGGTTGGAAC GAGAACCAGT 900 

CAAGGCAAAG TAAACTCTCC GGTCCCCAAA GGGTGTTAAC TAGGTCCGCT TTCATGTGCA 960 



- 135- 

TCAGACAGTA CACCTATAGC AAGCACACGT AGCAGTGTTA GGCTTTTTCA TACAGTATGT 1020 

AAGC TTAGTG TTAGTATCTG TCAGATGCTA CCTGCTGTTA CTTATTCAGA TAAACATGGT 1080 

GCCTATTGGA ACAATAGCGG ATAGAGCTAC AGGTGTTCAG TAAGACTACA AAAACATTTT 1140 

GCCTATTTCG CTAACAGTTT GGTTTTTAAT GGCTGTGGTA TTTGAGTGAG GCAACTCTGG 1200 

GGCATTTGTT ATGAAGAAAT G 1221 



(2) INFORMATION FOR SEQ ID NO: 20: 

~ (iJ SEQUENCE CHARACTERISTICS: 

fSj <A) LENGTH: 23 69 base pairs 

m (B) TYPE: nucleic acid 

fj (C) STRANDEDNESS : single 

HI (V) TOPOLOGY: linear 

Q (ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 116.. 1330 



TXx) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



GGCACGAGGC GGTGGTGGCG GCGGCGGGCG CGGCCGCGGC GGGGCGGGCG CGGAATGAAG 60 

GCCGACGGCC CTGGGGGCTG AGGCGCCCGC CGCCTGGGGC GGGCCGCGCG TCCTC ATG 118 

Met 
1 

GAG GCC GGA GAG GAG CCG CTG CTG CTG GCT GAA CTC AAG CCT GGG CGC 166 
Glu Ala Gly Glu Glu Pro Leu Leu Leu Ala Glu Leu Lys Pro Gly Arg 
5 10 15 

CCC CAC CAG TTC GAC TGG AAG TCA AGC TGC GAG ACC TGG AGC GTG GCC 214 
Pro His Gin Phe Asp Trp Lys Ser Ser Cys Glu Thr Trp Ser Val Ala 
20 25 30 

TTC TCG CCA GAC GGT TCC TGG TTC GCC TGG TCT CAA GGA CAC TGC GTG 262 
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Phe Ser Pro Asp Gly Ser Trp Phe Ala Trp Ser Gin Gly His Cys Val 
35 40 45 

GTC AAG CTG GTC CCC TGG CCC TTA GAG GAA CAG TTC ATC CCT AAA GGA 
Val Lys Leu Val Pro Trp Pro Leu Glu Glu Gin Phe lie Pro Lys Gly 
50 55 60 65 

TTC GAA GCC AAG AGC CGA AGC AGC AAG AAT GAC CCA AAA GGA CGG GGC 
Phe Glu Ala Lys Ser Arg Ser Ser Lys Asn Asp Pro Lys Gly Arg Gly 
70 75 80 

AGT CTG AAG GAG AAG ACG CTG GAC TGT GGC CAG ATT GTG TGG GGG CTG 
ser Leu Lys Glu Lys Thr Leu Asp Cys Gly Gin lie Val Trp Gly Leu 
85 90 95 

GCC TTC AGC CCG TGG CCC TCT CCA CCC AGC AGG AAA CTC TGG GCA CGT 
Ala Phe Ser Pro Trp Pro Ser Pro Pro Ser Arg Lys Leu Trp Ala Arg 
100 105 110 

CAC CAT CCC CAG GCG CCT GAT GTT TCT TGC CTG ATC CTG GCC ACA GGT 
His His Pro Gin Ala Pro Asp Val Ser Cys Leu lie Leu Ala Thr Gly 
115 120 125 

CTC AAC GAT GGG CAG ATC AAG ATT TGG GAG GTA CAG ACA GGC CTC CTG 
Leu Asn Asp Gly Gin lie Lya lie Trp Glu Val Gin Thr Gly Leu Leu 
130 135 140 145 

CTT CTG AAT CTT TCT GGC CAC CAA GAC GTC GTG AGA GAT CTG AGC TTC 
Leu Leir Asn Leu Ser Gly His Gin Asp Va.1 Val Arg Asp Leu Ser Phe 
150 155 160 

ACG CCC AGC GGC AGT TTG ATT TTG GTC TCT GCA TCC CGG GAT AAG ACA 
Thr Pro Ser Gly Ser Leu He Leu Val Ser Ala Ser Arg Asp Lys Thr 
165 170 175 

CTT CGA ATT TGG GAC CTG AAT AAA CAC GGT AAG CAG ATC CAG GTG TTA 
Leu Arg lie Trp Asp Leu Asn Lys His Gly Lys Gin He Gin Val Leu 
180 185 190 

TCC GGC CAT CTG CAG TGG GTT TAC TGC TGC TCC ATC TCC CCT GAC TGT 
Ser Gly His Leu Gin Trp Val Tyr Cys Cys Ser He Ser Pro Asp Cys 
195 200 205 



AGC ATG CTG TGC TCT GCA GCT GGG GAG AAG TCG GTC TTT CTG TGG AGC 
Ser Met Leu Cys Ser Ala Ala Gly Glu Lys Ser Val Pne Leu Trp Ser 
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210 215 220 225 

ATG CGG TCC TAC ACA CTA ATC CGG AAA CTA GAA GGC CAC CAA AGC AGT 838 
Mec Arg Ser Tyr Thr Leu lie Arg Lys Leu Glu Gly His Gin Ser Ser 
230 235 240 

GTT GTC TCC TGT GAT TTC TCT CCT GAT TCA GCC TTG CTT GTC ACA GCT 886 
Val Val Ser Cys Asp Phe Ser Pro Asp Ser Ala Leu Leu Val Thr Ala 
245 250 255 

TCG TAT GAC ACC AGT GTG ATT ATG TGG GAC CCC TAC ACC GGC GCG AGG 934 
Ser Tyr Asp Thr Ser Val lie Met Trp Asp Pro Tyr Thr Gly Ala Arg 
260 265 270 

CTG AGG TCA CTT CAT CAC ACA CAA CTT GAA CCC ACC ATG GAT GAC AGT 982 
Leu Arg Ser Leu His His Thr Gin Leu Glu Pro Thr Met Asp Asp Ser 
275 280 285 

GAC GTC CAC ATG AGC TCC CTG AGG TCC GTG TGC TTC TCA CCT GAA GGC 1030 
Asp Val His Met Ser Ser Leu Arg Ser Val Cys Phe- Ser Pro Glu Gly 
290 295 300 305 

TTG TAT CTC GCT ACG GTG GCA GAT GAC AGG CTG CTC AGG ATC TGG GCT 1078 
Leu Tyr Leu Ala Thr val Ala Asp Asp Arg Leu Leu Arg lie Trp Ala 
310 315 320 

CTG GAA CTG AAG GCT CCG GTT GCC TTT GCT CCG ATG ACC AAT GGT CTT 1126 
Leu Glu Leu Lys Ala Pro Val Ala Phe Ala Pro Met Thr Asn Gly Leu 
325 330 335 

TGC TGC ACG TTC TTC CCA CAC GGT GGA ATT ATT GCC ACA GGG ACG AGA 1174 
Cys Cys Thr Phe Phe Pro His Gly Gly lie He Ala Thr Gly Thr Arg 
340 345 350 

GAT GGC CAT GTC CAG TTC TGG ACA GCT CCC CGG GTC CTG TCC TCA CTG 1222 
Asp Gly His Val Gin Phe Trp Thr Ala Pro Arg Val Leu Ser Ser Leu 
355 360 365 

AAG CAC TTA TGC AGG AAA GCC CTC CGA AGT TTC CTG ACA ACG TAT CAA 1270 
Lys His Leu Cys Arg Lys Ala Leu Arg Ser Phe Leu Thr Thr Tyr Gin 
370 375 380 385 

GTC CTA GCA CTG CCA ATC CCC AAG AAG ATG AAA GAG TTC CTC ACA TAC 1318 
val Leu Ala Leu Pro He Pro Lys Lys Met Lys Glu Phe Leu Thr Tyr 
390 395 400 
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AGG ACT TTC TAGCAGTGCC GGCTCCCCCA CCTCCTGCAG CAGCAGCAGT 1367 
Arg Thr Phe 

405 

ACAAGGGACT GGCTAGGATG GAGTCAGGCA GCTCACACTG GACCAGTGTG GACCTTCCTT 1427 

CCTCCCATGG CATGTGCAAG TAGGTCTGCG TGACCCCACT TCTGTGGTGC CGGCCTTACC 1487 

TCGTCTTCAT CCGTGGTGAG CAGCCTTCGT CAGTCTAGTT GTGTTGAAGC CAAGTGCAGT 1547 

TGTGGATGTT GCTGGGGTAA TAAAGGCAAG CGGGCTCCAG AGCCTC TCTG GTGGCGGCCA 1607 

AGCCACACTC CCTTAACTGG GAAGTACCTG CCACGTAGGG CATTTCTGCT GCCTATTTCC 1667 

AGCCAGCGGC TGCATGGTTT GAAGTTCCTC CGTTGTGGTC AGAAGAACTC TGGTGTTTGG 1727 

TTCCCTGCTC AGCTGCGCGT GGACTGGGCT GAGCTCCTCA CCATACACTA GTGCCGGCTT 1787 

TTGTTTCCTG TAAACAGTGG TTGCATGTGT AGAGAAGTAA CAAGCGAGTA TTCAGATCAT 1847 

ACGAGGAGGC GTTCCTCGGT GCATGACGGT CAGATGGCCA TTTATCAGCA TATTTATTTG 1907 

TATTTTCTCA GCACATAGXA AGGTACAACT GTGTTTTCTC AATTGTCTCG AAAAAACAGA 1967 

GTTCTTAAGT GGCCCAGTTG TGGAGCCAAG TCTAAGTCGT GTGGAGTCAG TGCTGACATC 2027 

ACTGGCTTGT GCTGTCTGTC ACATGTGTTT GTCTCTGCTG CTTGACCTCA TGGGATGTAC 2087 

CCTCeftCTTC AACTGCCCAA AACAGACAGC CCCTTCCAAG CACCGTTCTT TGACAGCGGT 2147 

AGCAGCTACC TATTCAAGAC GCCTCACACA AAATC TGCCT TAGAAAGTTA ATATATTTTA 2 207 

AATTATTTTA AAAGAAACTC AACATCTTAT TCTTTGGCCT TTCTTAATTG ATGCTTTATG 2267 

GAGGCAGTGT TAACATTGTA CAGTGTATGC ATAGAGGAGT CTCCTCTATT TGAAGAACAA 2327 



TGCAAAATGA GGCTTTCATT GAAGGGAAAA AAAAAAAAAA AA 



2369 



(2) INFORMATION FOR SEQ ID NO: 21: 

U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 404 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xil SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Glu Ala Gly Glu Glu Pro Leu Leu Leu Ala Glu Leu Lys Pro Gly 
15 10 15 

Arg Pro His Gin Phe Asp Trp Lys Ser Ser Cys Glu Thr Trp Ser Val 
20 25 30 

Ala Phe Ser Pro Asp Gly Ser Trp Phe Ala Trp Ser Gin Gly His Cys 
35 40 45 

Val Val Lys Leu Val Pro Trp Pro Leu Glu Glu Gin Phe He Pro Lys 
50 55 60 

Gly Phe Glu Ala Lys Ser Arg Ser Ser Lys Asn Asp Pro Lys Gly Arg 
65 70 75 80 

Gly Ser Leu Lys Glu Lys Thr Leu Asp Cys Gly Gin He Val Trp Gly 
85 90 95 

Leu Ala Phe Ser Pro Trp Pro Ser Pro Pro Ser Arg Lys Leu Trp Ala 
100 105 110 

Arg His His Pro Gin Ala Pro Asp Val Ser Cys Leu He Leu Ala Thr 
115 120 125 

Gly fcetr Asn Asp Gly Gin He Lys He Trp Glu Val Gin Thr Gly Leu 
130 135 140 

Leu Leu Leu Asn Leu Ser Gly His Gin Asp Val Val Arg Asp Leu Ser 
145 150 155 160 

Phe Thr Pro Ser Gly Ser Leu He Leu Val Ser Ala Ser Arg Asp Lys 
165 170 175 

Thr Leu Arg He Trp Asp Leu Asn Lys His Gly Lys Gin He Gin Val 
180 185 190 

Leu Ser Gly His Leu Gin Trp Val Tyr Cys Cys Ser He Ser Pro Asp 
195 200 205 



Cys ser Met Leu Cys Ser Ala Ala Gly Glu Lys Ser Val Phe Leu Trp 
210 215 220 
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Ser Met Arg Ser Tyr Thr Leu lie Arg Lys Leu Glu Gly His Gin ser 
225 230 235 240 

Ser Val Val Ser Cys Asp Phe Ser Pro Asp Ser Ala Leu Leu Val Thr 
245 250 255 

Ala Ser Tyr Asp Thr Ser Val lie Men Trp Asp Pro Tyr Thr Gly Ala 
260 265 270 

Arg Leu Arg Ser Leu His His Thr Gin Leu Glu Pro Thr Met Asp Asp 
275 280 285 

Ser Asp Val His Met Ser Ser Leu Arg Ser Val Cys Phe Ser Pro Glu 
290 295 300 

Gly Leu Tyr Leu Ala Thr Val Ala Asp Asp Arg Leu Leu Arg lie Trp 
305 310 315 320 

Ala Leu Glu Leu Lys Ala Pro Val Ala Phe Ala Pro Met Thr Asn Gly 
325 330 335 

Leu Cys Cys Thr Phe Phe Pro His Gly Gly lie lie Ala Thr Gly Thr 
340 345 350 

Arg Asp Gly His Val Gin Phe Trp Thr Ala Pro Arg Val Leu Ser Ser 
3S5 360 365 

Leu Lys His Leu Cys Arg Lys Ala Leu Arg Ser Phe Leu Thr Thr Tyr 
J7CT 375 380 

Gin Val Leu Ala Leu Pro lie Pro Lys Lys Met Lys Glu Phe Leu Thr 
385 390 395 400 



Tyr Arg Thr Phe 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1246 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 
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(ii) MOLECULE TYPE: DNA 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GACACTGCAT CGTCAAACTG ATCCCCTGGC CGTTGGAGGA GCAGTTCATC CCTAAAGGGT 60 

TTGAAGCCAA AAGCCGAAGT AGCAAAAATG AGACGAAAGG GCGGGGCAGC CCAAAAGAGA 120 

AGACGCTCGA C TGTGGTC AG ATTGTCTGGG GGCTGGCCTT CAGCCTGTGC TTTCCCCACC 180 

CAGCAGGAAG CTCTGGGCAC GCCACCACCC CCAAGTGCCC GATGTCTCTT GCCTGGTTCT 240 

TGCTACGGGA CTCAACGATG GGCAGATCAA GATCTGGGAG GTGCAGACAG GGCTCCTGCT 300 

TTTGAATCTT TCCGGCCACC AAGATGTCGT GAGAGATCTG AGCTTCACAC CCAGTGGCAG 3 60 

TTTGATTTTG GTCTCCGCGT CACGGGATAA GACTCTTCGC ATCTGGGACC TGAATAAACA 420 

U\ CGGTAAACAG ATTCAAGTGT TATCGGGCCA CCTGCAGTGG GTTTACTGCT GTTCCATCTC 480 

: : 5 

1ST • 

Q CCCAGACTGC AGCATGCTGT GCTCTGCAGC TGGAGAGAAG TCGGTCTTTC TATGGAGCAT 540 

M GAGGTCCTAC ACGTTAATTC GGAAGCTAGA GGGCCATCAA AG C AG TGTTG TCTCTTGTGA 600 

CTTCTCCCCC GACTCTGCCC TGCTTGTCAC GGCTTCTTAC GATACCAATG TGATTATGTG 660 

^ - GGACCCCTAC ACCGGCGAAA GGCTGAGGTC ACTCCACCAC ACCCAGGTTG ACCCCGCCAT 720 

GGATGACAGT GACGTC CACA TTAGCTCACT GAGATCTGTG TGCTTCTCTC CAGAAGGCTT 790 

GTACCTTGCC ACGGTGGCAG ATGACAGACT CCTCAGGATC TGGGCCCTGG AACTGAAAAC 840 

TCCCATTGCA TTTGCTCCTA TGACCAATGG GCTTTGCTGG CACATTTTTT CCACATGGTG 900 

GAGTCATTGC CACACGGACA AGAGATGGCC ACGTCCAGTT CTGGACAGCT CCTAGGGTCC 960 

TGTCCTCACT GAAGCACTTA TGCCGGAAAG CCCTTCGAAG TTTCCTAACA ACTTACCAAG 1020 

TCCTAGCACT GCCAATCCCC AAGAAAATGA AAGAGTTCCT CACATACAGG ACTTTTTAAG 1080 

CAACACCACA TCTTGTGCTT CTTTGTAGCA GGGTAAATCG TCCTGTCAAA GGGAGTTGCT 1140 

GGAATAATGG GCCAAACATC TGGTCTTGCA TTGAAATAGC ATTTCTTTGG GATTGTGAAT 1200 
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AGAATGTAGC AAAAC C AG AT TCCAGTGTAC TAGTCATGGA TTTTTC 1246 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

ACCATGGTTC CAAGTCCTCT CCCCTGTGGT CAAGTTGCCC GAATGTTGGG CCCAAGTGCC 60 

TTTTCCTCCT TGGGCCTCCC CTTCTGACCT GCAGGACAGT TTTCCGGAGC CCATTTGGTA 120 

TGAGGTATTA ATTAGCCTTA ACTAAATTAC AGGGGACTCA GAGGCCGTGC TCCTGACCGA 180 

TC C AG AC AC T ATTTTTTTTT TTTTTTTTTA ACAATGGTGT GCATGTGCAG GAAATGACAA 240 

ATTTGTATGT CAGATTATAC AAGGATGTAT TCTTAAACCG CATGACTATT CAGATGGCTA 300 

CTGAGTTATC AGTGGCCATT T ATT AG CATC ATATTTATTT GTATTTTCTC AACAGATGTT 360 

AAGGTACAAC TGTGTTTTTC TCGATTATCT AAAAAC CAT A GTACTTAAAT TGAAAAAAAA 420 

AA 422 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2019 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

GGCACGAGGC GGGGTCAGGG CGGAGGCTGA GGACCAAGTA GGCATGGCGG AGGGCGGGAC 60 

CGGCCCCGAT GGACGGGCCG GCCCGGGACC CGCAGGTCCT AATCTGAAGG AGTGGCTGAG 120 

GGAGCAGTTC TGTGACCATC C AC TGGAGCA CTGTGACGAT ACAAGACTCC ATGATGCAGC 180 

CTATGTAGGG GACC TCCAGA CCCTCAGGAA CCTACTGCAA GAGGAGAGCT ACCGGAGCCG 2 40 

CATCAATGAG AAGTCTGTCT GGTGCTGCGG CTGGCTTCCC TGCACACCAC TCAGGATCGC 3 00 

AGCCACTGCA GGCCATGGGA AC TGTGTGGA CTTCCTCATA CGCAAAGGGG CCGAGGTGGA 3 60 

CCTGGTGGAT GTCAAGGGGC AGACTGCCCT GTATGTGGCT GTAGTGAACG GGCACTTGGA 420 

GAGCACTGAG ATCCTTTTGG AAGCTGGTGC TGATCCCAAC GGCAGCCGGC ACCACCGCAG 480 

CACTCCTGTG TACCATGCCT YTCGTGTGGG TAGGGACGAC ATCCTGAAGG CTCTTATCAG 540 

GTATGGGGCA GATGTTGATG TCAACCATCA TCTGAATTCT GACACCCGGC CCCCTTTTTC 600 

ACGGCGGCTA ACCTCCTTGG TGGTCTGTCC TCTATACATC AGTGCTGCCT ACCATAACCT 660 

TCAGTGCTTC AGGCTGCTCT TGCAGGCTGG GGCAAATCCT GACTTCAATT GCAATGGCCC 720 

TGTCKKCACC CAGGAGTTCT ACAGGGGATC CCCTGGGTGT GTCATGGATG CTGTCCTGCG 780 

CCATGGCTGT GAAGCAGCCT TCGTGAGTCT GTTGGTAGAG TTTGGAGCCA ACCTGAACCT 840 

GGTGAAGTGG GAATCCCTGG GCCCAGAGGC AAGAGGCAGA AGAAAGATGG ATCCTGAGGC 900 

CTTGCAGGTC TTTAAAGAGG CCAGAAGTAT TCCCAGGACC TTGCTGAGTT TGTGCCGGGT 9 60 



GGCTGTGAGA AGAGCTCTTG GCAAATACCG ACTGCATCTG GTTCCCTCGC TGCCGCTGCC 1020 

AGACCCCATA AAGAAGTTTT TGCTTTATGA GTAGCATTCA CATGCAGTGC TGACTGCAAT 1080 

GTGGAAGCCG ATCACCTGCA GTGAAAACTG ACACAGACTC TGGCATCCTG GGAACCATGG 1140 

CCTGTGCTGC CAGCTTGATC CTTGGCTGTC AGTGAAGAAA AAACGGCTGT GTTCTCTTGG 1200 

AC TGTGATTC TATCTCAGGT GCTTGGGCCA TCGAACGCTC CTTGAGTCAT TGTCAACTGA 12 60 
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GAGGCACATA CAAACTTAAT TTTGTTCCTC TTCAGTCTCT C TGTTTTGG A TTCTTCCTGG 
CAATGTGTGC AGCATGGGCT GAGCCTGGTG ATTGCCCTAG TGGGGAAGGC TTTTTTCTCC 
AGGC TATGC A TCTATTTATG TTCCTACTTT GCAATTTATT GTTCTTTTAA GGCTTGATAT 
CAAAACAGAA AGAGGTTTGT TAAGAAAAGA TATAGGGAGA AAGGAATTCC GGTTCCGTGC 
AC TTGCTAGC CTGCTTTCCT TGCCTGGGTT TGTCTGTCTA TGCTGCCTGG TGCACATCCC 
TTCTC TTTGC TGC C ACTGTT CTATTTTGGG AGTTGTCTTC CGTCTAAGAT GGCTTCTGGG 
GTTCTATCTT ATTGCACAGA GGTCC CAGAA CAGTGTTCAT AGGGCACCAT CTGCTCTGCC 
AAGGGTTTTC TGATGTCTTA CCCTGGGGAT CTTCAGACAG TGGTTACCTT TAGGAGACCC 
ACCTGGAACT AACCATTAAG TGACTGCCCA CATTCAGATC AGGGACCATC TTAATAGTAC 
TCACTGCCAG TCCTCACAAG AGAAGATGAC ACGGGTGCTC TCTTCAGACA CTCCCATACA 
GGAAGTTGGA AAATGTCTTG GTCACCTGGG TTGTTCCCAG GCTACAACTT CTTGGTGTTC 
CACTAARACC AGRATATCCT AGTTTTTTGG GTTGAC TGTT CCCTCCCCAC TTTCCTTGAA 
NCCCAATGCC CNTTTGTKTN GGTTGCTTCC CTAAAAKTT 

(2) INFORMATION FOR SEQ ID NO : 2 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: SNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Ala Arg Gly Gly Val Arg Ala Glu Ala Glu Asp Gin Val Gly Met Ala 
15 10 15 



1320 

1380 
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Glu Gly Gly Thr Gly Pro Asp Gly Arg Ala Gly Pro Gly Pro Ala Gly 
20 25 30 

Pro Asn Leu Lys Glu Trp Leu Arg Glu Gin Phe Cys Asp His Pro Leu 
35 40 45 

Glu His Cys Asp Asp Thr Arg Leu His Asp Ala Ala Tyr Val Gly Asp 
50 55 60 

Leu Gin Thr Leu Arg Asn Leu Leu Gin Glu Glu Ser Tyr Arg Ser Arg 
65 70 75 80 

lie Asn Glu Lys Ser val Trp Cys Cys Gly Trp Leu Pro Cys Thr Pro 
85 90 95 

Leu Arg lie Ala Ala Thr Ala Gly His Gly Asn Cys Val Asp Phe Leu 
100 105 110 

lie Arg Lys Gly Ala Glu Val Asp Leu Val Asp Val Lys Gly Gin Thr 
115 120 125 

Ala Leu Tyr Val Ala Val Val Asn Gly His Leu Glu Ser Thr Glu lie 
130 135 140 

Leu Leu Glu Ala Gly Ala Asp Pro Asn Gly Ser Arg His His Arg Ser 
145 150 155 160 

Thr Pro Val Tyr His Ala Xaa Arg Val Gly Arg Asp Asp lie Leu Lys 
— 165 170 175 

Ala Leu He Arg Tyr Gly Ala Asp Val Asp Val Asn His His Leu Asn 
180 185 190 

Ser Asp Thr Arg Pro Pro Phe Ser Arg Arg Leu Thr Ser Leu Val Val 
195 200 205 

Cys Pro Leu Tyr He Ser Ala Ala Tyr His Asn Leu Gin Cys Phe Arg 
210 215 220 

Leu Leu Leu Gin Ala Gly Ala Asn Pro Asp Phe Asn Cys Asn Gly Pro 
225 230 235 240 



Val Asn Thr Gin Glu Phe Tyr Arg Gly Ser Pro Gly Cys Val Met Asp 
245 250 255 
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Ala Val Leu Arg His Gly Cys Glu Ala Ala Phe Val Ser Leu Leu Val 
260 265 270 

Glu Phe Gly Ala Asn Leu Asn Leu Val Lys Trp Glu ser Leu Gly Pro 
275 280 295 

Glu Ala Arg Gly Arg Arg Lys Met Asp Pro Glu Ala Leu Gin Val Phe 
290 295 300 

Lys Glu Ala Arg Ser He Pro Arg Thr Leu Leu Ser Leu Cys Arg Val 
305 310 315 320 

Ala Val Arg Arg Ala Leu Gly Lys Tyr Arg Leu His Leu Val Pro Ser 
325 330 335 

Leu Pro Leu pro Asp Pro He Lys Lys Phe Leu Leu Tyr Glu 
340 345 350 



(2) INFORMATION FOR SEQ ID NO: 26: 

%1 (i) SEQUENCE CHARACTERISTICS: 

% (A) LENGTH: 419 base pairs 

(B) TYPE: nucleic acid 
j\ (C) STRANDEDNE5S: single 

f3 (D) TOPOLOGY: linear 

f.iyi 

ill (ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GCATCCATGG CGGAGGGCGG CAGCACGACG GGCGGGCAGG GCCGGGCTCC GCAGGTCGTA 60 

ATCTGAAGGA GTGGCTGAGG GAG C AATTTT GTGATCATCC GCTGGAGCAC TGTGAGGACA 120 

CGAGGCTCCA TGATGCAGCT TACGTCGGGG ACCTCCAGAC CCTCAGGAGC CTATTGCAAG 160 

AGGAGAGCTA CCGGAGCCGC ATCAACGAGA AGTCTGTCTG GTGCTGTGGC TGGCTCCCCT 240 

GCACACCGTT GCGAATCGCG GCCACTGCAG GCCATGGGAG CTGTGTGGAC TTCCTCATCC 300 

GGAAGGGGGC CGAGGTGGAT CTGGTGGACG TAAAAGCACA GACGGCCCTG TATGTGGCTG 360 
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TGGTGAACGG GCACCTAGAG AGTACCCAGA TCCTTCTCGA AGCTGGCGCG GACCCCAAC 419 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 5 base pairs 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE : DNA 
^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27; 

J2 GAGGAAGAAG AAAAGTGGAC CCTGAGGCCT TGCAGGTCTT TAAAG AGGC C AGAAGTGTTC 60 

fU CCAGAACCTT GCTGTGTCTG TGCCGTGTGG CTGTGAGAAG AGCTCTTGGC AAAACCGGCT 120 

y a 

TCATCTGATT CCTTCGCTGC CTCTGCCAGA CCCCATAAAG AAGTTTCTAC TCCATGAGTA 180 

S 8 

j\ GACTCCAAGT GCTGCGGTTG ATTCCAGTGA GGGAGAAAGT GATCTGCAGG GAGGTGGACA 240 

hj CCGAGCCCTG AGTGC TGTGC TGCTGCTGGT CTCCTGATGG CTGTTGCTGC AGAAGATGTC 300 

■S CTCGTAGACT GTCAXTGCTC CTCAGGTGCC TGGGCCGCTG AACAGTCCTT GGGTCATTGT 360 

- CAGCTGAGAG GCTTATACTA AAGTTATTAT TGTTTTTCCC AAGTTCTCTG TTCTGGATTT 420 

TCAGTTGCAT ATTAATGTAA CGGGCCATGG GGTATGTACA TGTAGGGGCT GAGGTTGGAG 480 

GCCTACTAAT TTCCTGTAGG GAAGACTCCC AGCACTTCTG GAACTGTGCT TCTCTTTATT 540 

TTTCTACTTC TCAATTTGAT GGTTCGATTA AAGCCTTCTA GTATCTCAAT GAAAA 595 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 696 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: linear 
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(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 4.. 39 € 



(XX) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

CTG ATG TCC GCA ATT CTG AAG GTT GGA CAC CAC TGC TGG CTG CCT GTG 
Met Ser Ala lie Leu Lys Val Gly His His Cys Trp Leu Pro Val 
1 5 10 15 

AC A TCC GCT GTC AAT CCC CAA AGG ATG CTG AGG CCA CCA CCA ACC GCT 
Thr Ser Ala Val Asn Pro Gin Arg Met Leu Arg Pro Pro Pro Thr Ala 
20 25 30 

GTT TTC AAC TGT GCC GCT TGC TGC TGT CTG TGG GGG CAG ATG CTG ATG 
Val Phe Asn Cys Ala Ala Cys Cys Cys Leu Trp Gly Gin Met Leu Met 
35 40 45 

AAT ACA TAC CGT GTA GTT CAG CTT CCT GAG GAG GCC AAG GGC TTG GTG 
Asn Thr Tyr Arg Val Val Gin Leu Pro Glu Glu Ala Lys Gly Leu Val. 
50 55 60 

CCA CCA GAG ATT CTA CAG AAG TAC CAT GGA TTC TAC TCT TCC CTC TTT 
Pro Pro Glu lie Leu Gin Lys Tyr His Gly Phe Tyr Ser Ser Leu Phe 
-6-Sr 70 75 

GCC TTG GTG AGG CAG CCC AGG TCG CTG CAG CAT CTC TGC CGT TGT GCG 
Ala Leu Val Arg Gin Pro Arg Ser Leu Gin His Leu Cys Arg Cys Ala 
80 85 90 95 

CTC CGC AGT CAC CTG GAG GGC TGT CTG CCC CAT GCA CTA CCG CGC CTT 
Leu Arg Ser His Leu Glu Gly Cys Leu Pro His Ala Leu Pro Arg Leu 
100 105 110 

CCC CTG CCA CCG CGC ATG CTC CGC TTT CTG CAG CTG GAC TTT GAG GAT 
Pro Leu Pro Pro Arg Met Leu Arg Phe Leu Gin Leu Asp Phe Glu Asp 
115 120 125 



CTG 
Leu 



CTC 
Leu 



TAC 
Tyr 
130 



TAGGCTTGCT GCCCTGTGAA CAAAGCAGAC CCCACCCCCA 
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CCCCAAGGGC ATCTCTCAGC AATGAATGAT GCAAGGCGGT CTGTCTTCAA GTCAGGAGTG 493 

GACGCCTTGA TCCACACTTG AGAGAAGAGG CCAGATCAGC ACCYGGCTGG TAGTGATNGC 553 

AGAGGGCACC TGTGCAGATC TGTGTGCGCA CTGGAAATCT CTAGGCTGAA GGCYAGAGCA 613 

AATGGTGCAR GTGTTAGTCC TTGGGANGAG AGACAGANGG TGAGAAAGCA AGACAGAGGT 673 

GAGAGTGCAC ATGTC AAGTG GTAGATTGCC TTAAAAGAAA GCTAAAAAAA GAAAAAGATT 733 

CGGGCGAACT TCTTTAGGGG TAATGCTGCA GCGTGTTAAA CTGACTGACC AGCGTCCATA 793 

TCTTTGGACC CTTCCCGGGT GAAAAAGCCC CTTCATCCTC CAGCGCTCCC CAAGGGTGCT 853 

O TAGCAATACC GGGTGCTTTT CTGCCGCAAA GTGAGTTACC AAA 89 6 

5i (2) INFORMATION FOR SEQ ID NO: 29: 

5 k£ 

y \ 

fft (i) SEQUENCE CHARACTERISTICS: 

S (A) LENGTH: 130 amino acids 

3 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

f7 (ii) MOLECULE TYPE: procein 

C§ (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Ser- Ala lie Leu Lys Val Gly His His Cys Trp Leu Pro Val Thr 
15 10 15 

Ser Ala val Asn Pro Gin Arg Met Leu Arg Pro Pro pro Thr Ala val 
20 25 30 

Phe Asn Cys Ala Ala Cys Cys Cys Leu Trp Gly Gin Met Leu Met Asn 
35 40 45 

Thr Tyr Arg val val Gin Leu Pro Glu Glu Ala Lys Gly Leu Val Pro 
50 55 60 

Pro Glu lie Leu Gin Lys Tyr His Gly Phe Tyr Ser Ser Leu Phe Ala 
65 70 75 80 

Leu val Arg Gin Pro Arg Ser Leu Gin His Leu Cys Arg Cys Ala Leu 

85 90 95 
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Arg Ser His Leu Glu Gly Cys Leu Pro His Ala Leu Pro Arg Leu Pro 
100 105 HO 

Leu Pro Pro Arg Met Leu Arg Phe Leu Gin Leu Asp Phe Glu Asp Leu 
115 120 125 

Leu Tyr 
130 

(2) INFORMATION FOR SEQ ID NO: 30: 

^ (i) SEQUENCE CHARACTERISTICS: 
m (A) LENGTH: 43 6 base pairs 

lQ (B) TYPE: nucleic acid 

CP (C) STRANDEDNESS : single 

fU (D) TOPOLOGY: linear 

rri 

Zl (ii) MOLECULE TYPE: DNA 

s : 

U 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

GTGGGGGCGT CATCATGACC TCCTCTAGGG CTCTGCAACA TGACTCCTGT GGTGCAAATC 60 

AACAAATTGT TCACTGATGA ATCCACAAGG ATCTCTGGGC CTACAACCAG GTCCTGGTCC 120 

ACATGACTGT CGTCTTCGGA GAAGGCACCA CTCGCCCCCG GCAGGTACGG CTGACACCTC 180 

CATGGGAGAA GACGTATCCA GGCAGCAGCT GCGCGGCCCT TCAAGAGGGC ACATCCCGTC 240 

ATCTAAAGGC ACGGTGTACT GAAGGTAGTC CTGAGACATG AGTCCGATTA CTACAGGCAC 300 

GTGTTCCTCC AGGTGGAGGC TCAGGTCCCC GGGTGAGCTG GGGCTGCAGC GGGACTCAGG 3 60 

GCGCGGCTCT GGCTGC AGGT CTCGCAGCTC CCTGGGCTGT AGCTCCCGCA GATCCTTGCG 420 

CACACCGTTG ACTGGT 436 
(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 2180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

TTAATAGTAC CTACATAGTA GAAAATTATA ACTCCACTTT AAAACAATGT TTTCTTTCTA 60 

TTCAAATCAA TTTAAAACTT TTTATAAACA TTAATGTTGC AAGAGAATCC AGTCCATTTA 120 

S TGAAAATTAG TTGACAATCA AGTTCACCCA AGAAAATGTT GACTAAGCTA AAGAAATCAC 180 

ff\ AGATAAAACA TTTT AC C AAA AGGATAGGTA ACACACAAAA AAATGCTATC ACAGGAAGCT 240 

ri I 

lH ATGATCATCT AATATTTCTT TAATAATAAT TCTAGTTCCA TAGGTTTTCA TGTTATGCCA 3 00 

^ ATTTGTACCC GAGTTTAATT ACAGAAAAGG CAACAATTTC TAAATTGGTG GTATACATTT 360 

n CTTTACAATT TTTTAATGTA AGGCCATTTA TT AAAA TAG A CAAACTAGAA GATGAAAACG 420 

: : : 

.U AAGGCAACAG AAAAATTCAA CTTTTCACAA CCAAAAGAAT TAGCACAACC TTAGAAATAA 480 

M TTTAGAAAAA AGTGTTGTTA AAAGATATGT TGCAGATCTC CGTTCCATTA CCCAAGATTA 540 

TGTCAATTCA CGATTCTAAA TAAATCTTTT TAAAGTAAGA GATTAAAAAC TCATCTTCAG 600 

TGTATATGTA AATTCCGTGG TTTTATCACA CAGGTATGTT TATTCAACAC TGCTTTGGAA 660 

ATGGACCATT TAAAAGGACA TGGCAATTTC CATTCTGTTA AGTTTCATTC AACCTTTACT 720 

TAGGGGTTGA TTACCACATG AAATGTGCTT T T AATG CAT A AAAATCACAG TGGATTAGCC 780 

AGCAAAAGGG ACTGGGCGGG GGGGGCATTG AGGAGAATTT GATAATTCAC ATTGTGATTA 840 

TTCTGCACAT TGATGAAACA TAATTCACAC CTCTAAAACC TCAAGACTTC CCTTTTTTAA 900 

AGAACCAAAA TAAACCCAAG ACACCTTGCT GACACTTCCC CACCCCTAAA CAAACTGATG 960 

AC TC TTTT AC ACATAAAACT GAAATAGTTA TGGCAGCAAA AGATTTTGAT GGCAATGAAA 1020 
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GTTTGTAAAC TGTATTTCAA TCTCTTGTTC TTATTCCCAA AGTGCAAGAT GCAGGGTTCT 1080 
CAATCTTTCA GTAGTGCTTC TCCTGTAAAT AATCCTTCAT TTTGTTTGGC AAAGGCAGTT 1140 

TCTGAATTAA GTCTATTCTG GTATACTGAC GTATAACAAA ACGACACAGG TACTGCAACG 1200 

AGCGCACCTA TGAACCCCGG AACACTGGTT GGCAAGTTCT GACGGAAGTG CAGATTCCAG 12 60 

GCAGCGAGAC CTTGAATAAC AAAAAGCTCC CATTTTCAGA GTCCCTGATT GAATGC TCCA 1320 

ATTAGATCAA CTATGGACGT ATGTCCTTCC ACATCGGCTG TTCATAAAAG CTAAACCTAC 1380 

CATTTGAGTG CTCAATTCTA GTGTGAAGTG TTTTACCATG GGAGCGAAAG TCACAGCTTA 1440 

AAAGGTAACG GTCGTCAGAA CTGTCCCGAA CAAGAAAAGA AC CATC TGGC ACGTTTGCTA 1500 

GCTTCCCTTC TGCCTCCCAA CGTGTGATTG GTCCCCAGTA CCATCCTTGC TTTGCAAGTT 1S60 

TTTTCAGCTC CTCTGTAAGG CTTGTCACAA CCATGGGACC AC T ACTTTGC AC TGAGTCAT 1620 

AAACTCTTGC AACCCCAGGA GCAGAGTTCG GATCAAAATT CAAATGACAG CGCATAACTT 1680 

TCAGCCACGT GGGGCTTTCT GTCCAGTGAG TCCACTGAAA GTTCCCCTTT GGGATTTGGA 1740 

TTATTCCTGC ATTGGAGTAA CCAATGGTGA AGATTGGAGG GACATCCATC GTGAACCCGC 1800 

TCTCCGGGGT TCTGCAACAT GACTCCCGTG GTGCCAATCA ACAAGCCATT CACCGGACTG 1860 

ATCCA€GAAG ATCTCTGGGG CGACAACTAG GTCCTGGTCT ACCTGACTCT CATCCTCGGG 1920 

GAAAGCGCGC CCTCCCACTT GAGGAGGAAC CGCAGAGACT TCCATGGGAG AAGAGCTGTC 1980 

CAGACAATAG CTCCGTGATC CTTCCAAAGG ATACATCCCC TCATCTAAAG GCACAGTATA 2 04 0 

CTGAATGTAG TCCTGAGGCA TAAGTCCAAT AACGACAGGC ACATGTTCAT CCAGGTGAAG 2100 

ATGCAGGTCT CCATTATGAG AAGCCGAGCT CTTCAGTGAA TTGGCTTGCT CCTGGCACGT 2160 

GGTCTCAGAC TGGAGGTCGT 2180 



(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS; 
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(A) LENGTH: 2 649 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

GGCACGAGGC TGTGTCCAGC ACACAGAGAG GGCCCGGCCA TCTGCTTTGG TTCAGAGCCC 60 

TGTGTCTGTC TGTCACTTAG ACTCTTCCTC CCGGCTCGCA GCTCACCCTC CATCCTCCTT 120 

ACTGGCTCCA GCATGACTCG CTTCTCTTAT GCAGAGTACT TTGCTCTGTT TCACTCTGGC 180 

TCTGCACCTT CCAGGTCCCC TTCGTCTCCC GAGAACCCAC CGGCCCGCGC ACCCCTGGGT 240 

CTGTTCCAAG GGGTCATGCA GAAGTATAGC AGCAACCTGT TCAAGACCTC CCAGATGGCG 300 

GCTATGGACC CCGTGCTGAA GGCCATCAAG GAAGGGGATG AAGAGGCCTT GAAGATCATG 360 

ATCCAGGATG GGAAGAATCT TGCAGAGCCC AACAAGGAGG GCTGGCTGCC GCTCCACGAG 420 

GCTGCCTACT ATGGCCAGCT GGGCTGCCTG AAAGTCCTGC AGCAAGCCTA CCCAGGGACC 480 

ATTGACCAAC GCACACTGCA GGAAGAGACA GCATTATACC TGGCCACATG CAGAGAACAC 540 

CTGGATTGCC TCCTGTCGCT GCTCCAGGCG GGGGCAGAGC CTGACATCTC TAACAAATCC 600 

AGGGAGACTC CACTTTACAA AGCCTGTGAG CGCAAGAACG CGGAGGCGGT GAGGATATTG 660 

GTGCGATACA ACGCAGACGC CAACCACCGC TGTAACAGGG GCTGGACCGC ACTGCACGAG 720 

TCTGTCTCCC GCAATGACCT GGAGGTCATG GAGATCCTAG TGAGTGGCGG GGCCAAGGTG 780 

GAGGCCAAGA ATGTCTACAG CATCACCCCT TTGTTTGTGG CTGCCCAGAG TGGGCAGCTG 840 

GAGGCCCTGA GGTTCCTGGC CAAGCATGGT GCAGACATCA ACACGCAGGC CAGTGACAGT 900 

GCATCAGCCC TCTACGAGGC CAGCAAGAAT GAGCATGAAG ACGTGGTAGA GTTTCTTCTC 960 

TCTCAGGGCG CCGATGCTAA CAAAGCCAAC AAGGACGGCC TGCTCCCCCT GCATGTTGCC 1020 
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TCCAAGAAGG GCAACTATAG AATAGTGCAG ATGCTGCTGC CTGTGACCAG CCGCACGCGC 1080 

GTGCGCCGTA GCGGCATCAG CCCGCTGCAC CTAGCGGCCG AGCGCAACCA CGACGCGGTG 1140 

CTGGAGGCGC TGCTGGCCGC GCGCTTCGAC GTGAACGCAC CTCTCGCTCC CGAGCGCGCC 1200 

CGCCTCTACG AGGACCGCCG CAGTTCTGCG CTCTACTTCG CTGTGGTCAA CAACAATGTG 1260 

TACGCCACCG AGCTGTTGCT GCTGGCGGGC GCGGACCCCA ACCGCGATGT CATCAGCCCT 13 2 0 

CTGCTCGTGG CCATCCGCCA CGGCTGCCTG CGCACCATGC AGCTGCTCTT GGACCATGGC 1380 

GCCAACATCG ACGCCTACAT CGCCACTCAC CCCACCGCCT XTCCAGCCAC CATCATGTTT 1440 

GCCATGAAGT GCCTGTCGTT ACTCAAGTTC CTTATGGACC TCGGCTGCGA TGGCGAGCCC 1500 

TGCTTCTCCT GCCTGTACGG CAACGGGCCG CACCACCCGC CCCGCGACCT GGCCGCTTCC 1560 

ACGACGCACC CGTGGACGAC AAGGCACCTA GCGTGGTGCA GTTCTGTGAG TTCCTGTCGG 1620 

CCCCGGAAGT GAGCCGCTGG GCGGGACCCA TCATCGATGT CCTCCTGGAC TATGTGGGCA 1680 

ACGTGCAGCT GTGCTCCCGG CTGAAGGAGC ACATCGACAG CTTTGAGGAC TGGGCTGTCA 1740 

TCAAGGAGAA GGCAGAACCT CCGAGACCTC TGGCTCACCT CTGCCGGCTG CGGCTTCGGA 1800 

AGGCCATAGG AAAATAC CGG ATAAAACTCC TGGACACACT GCCGCTTCCC GGCAGGCTAA 1860 

TCAGMftCTT GAAATATGAG AATACACAGT AACCAGCCTG GAGAGGAGAT GTGGCCTTCA 1920 

GACTGTTTCC GGGACGCCCC AGGTGGCCTG CATCCAGGAC CCCCTGGGGT CAGAACAGGT 1980 

GTGACCTTGC TGGTTCTTTG CTGGAGCTTC ACCCAAAGTG AGAACCTGAT GTGGGGAGTG 2040 

GACGTGGAAC CTCTGCTTTC ACACTGTCAG CGGATCGCAG ACCCGCTCTG CTTCTGGCCA 2100 

TAGCCAGAGA CCTTCAACCT GGGGCCAGGG GAGAGCTGGT CTGGGCAAGG TGGCCCAGGC 2160 

AGGAATCCTG GCCTTAAGCT GGAGAACTTG TAGGAATCCC TCACTGGACC CTCAGCTTTC 222 0 

AGGCTGCGAG GGAGACGCCC AGCCCAAGTA TTTTATTTCC GTGACACAAT AACGTTGTAT 2280 

CAGAAAAAAA AAAAAACATG GGCGCAGCTT ATTCCTTAGT AGGGTATTTA CTTGCATGCG 2340 

CGCTTAAAGC TACTGGAAAC A TG CGTTCC A CTATGCTTGA GAATCCCCTT GCACTGGTAA 2400 
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ACGAGAGCCG ACGTGCTTCA AGGTTGGATT TTTGGTTGCC CCTTTGGCGT TCCGCGGGTT 
TGTCCGACGT AATTGACCCC GTGTTTTGTC ACTTTCGAGT GTTCCGACTA TTGGGGGGCT 
TTTGGTTGTC CCCAAAATTG TGGGTGGTGT GCGGACGCCA CGAGAAGTGG TTCATGGGCG 
ATAATCATTA CTGGAGAATG TAGAGCGGCG GTTTTACGAA TAAATATTTT TTAAGCCGCC 
TTCCCAAAA 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
CCTCCTGAGA GTTCGCCGGC CCGGGCCCAA TGGGTTGTTC CAAGGGGTCA TGCAGAAATA 
CAGCAGCAGC TTGTTCAAGA CCTCCCAGCT GGCGCCTGCG GACCCCTTGA TAAAGGCCAT 
CAAGGKTGCG ATGAAGAGGC CTTGAAGACC ATGATCAAGG AAGGGAAGAA TCTCGCAGAG 
CCCAACAAGG AGGGCTGGCT GCCGCTGCAC GAGGCCGCAT ACTATGGCCA GGTGGGCTGC 
CTGAAAGTCC TGCAGCGAGC GTACCCAGGG ACCATCGACC AGCGCACCCT GCAGGAGGAA 
ACAGCCGTTT ACTTGGCAAC GTGCKGGGGC CACCTGGACT GTCTCCTGTC ACTGCTCCAA 
GCAGGGGCAG AGCGGGACAT CTCCAACAAA TCCCGAGAGA ACCGCTCTAC AAAGCCTGTG 
AGCGCAAGAA CGCGGAAGCC GTGAAGATTC TTGGTGCAGC ACAACGCAGA CACCAACAAC 
GCTGCAACCG GGCTG 

(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GTGCAGCTCT GCTCGCGGCT GAAGGAACAC ATCGACAGCT TTGAGGACTG GGCCGTCATC 
AAGGAGAAGG CAGAACCTCC AAGACCTCTG GCTCACCTTT GCCGACTGCG GGTTCGAAAG 
GCCATTGGGA AATACCGTAT AAAACTCCTA GACACCTTGC CGCTCCCAGG CAGGCTGATT 
AGATACCTGA AATACGAGAA CACCCAGTAA CTGGGGCCAC GGGGAGAGAG GAGTAGCCCC 
TCAGACTCTT CTTACTAAGT CTCAGGACGT CGGTGTTCCC AACTCCAAGG GGACCTGGTG 
ACAGACGAGG CTGCAGGCTG CCTCCCTCTC AGCCTGGACA GCTACCAGGA TCTCACTGGG 
TCTCAGGGCC CAGAGCTTTG GCCAGAGCAG AGAACAGAAT GTGTCAAGGA GAAGAATCAT 
TTGTTTACAA ACTGATGAGC AGATCCCAGA CCTTCTCTAC CTTCAGGAAT GGCAGAAACC 
TCTATTCCTG GGGCCAGGGC AGAGCTTGAG GTGTTCTGGG GAAGGTGGTG CTCAGAGCCT 
TCCCTGTGCC CCTCCACTTG TTCTGGAAAA CTCACCACTT GACTTCAGAG CTTTCTCTCC 
AAAGACTAAG ATGAAGACGT GGCCCAAGGT AGGGGGTAGG GGGAGCCTGG GTCTTGGAGG 
GCTTTGTTAA GTATTAATAT AATAAATGTT ACACATGTGA AAAAAAAAA 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 848 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: linear 
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(ii) MOLECULE TYPE: DNA 



(ix) FEATURE; 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..624 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

TTG GAG AAG TGT GGT TGG TAT TGG GGG CCA ATG AAT TGG GAA GAT GCA 
Leu Glu Lys Cys Gly Trp Tyr Trp Gly Pro Met Asn Trp Glu Asp Ala 
15 10 15 

GAG ATG AAG CTG AAA GGG AAA CCA GAT GGT TCT TTC CTG GTA CGA GAC 
Glu Met Lys Leu Lys Gly Lys Pro Asp Gly Ser Phe Leu val Arg Asp 
20 25 30 

AGT TCT GAT CCT CGT TAC ATC CTG AGC CTC AGT TTC CGA TCA CAG GGT 
Ser Ser Asp Pro Arg Tyr lie Leu Ser Leu Ser Phe Arg Ser Gin Gly 
35 40 45 

ATC ACC CAC CAC ACT AGA ATG GAG CAC TAC AGA GGA ACC TTC AGC CTG 
lie Thr His His Thr Arg Met Glu His Tyr Arg Gly Thr Phe Ser Leu 
50 55 60 

TGG TGT CAT CCC AAG TTT GAG GAC CGC TGT CAA TCT GTT GTA GAG TTT 
Trp Cys His Pro Lys Phe Glu Asp Arg Cys Gin Ser Val Val Glu Phe 
65 70 75 80 

ATT AAG AGA GCC ATT ATG CAC TCC AAG AAT GGA AAG TTT CTC TAT TTC 
lie Lys Arg Ala He Met Hie Ser Lys Asn Gly Lys Phe Leu Tyr Phe 
85 90 95 

TTA AGA TCC AGG GTT CCA GGA CTG CCA CCA ACT CCT GTC CAG CTG CTC 
Leu Arg Ser Arg Val Pro Gly Leu Pro Pro Thr Pro Val Gin Leu Leu 
100 105 110 

TAT CCA GTG TCC CGA TTC AGC AAT GTC AAA TCC CTC CAG CAC CTT TGC 
Tyr Pro Val Ser Arg Phe Ser Asn Val Lys Ser Leu Gin His Leu Cys 
115 120 125 

AGA TTC CGG ATA CGA CAG CTC GTC AGG ATA GAT CAC ATC CCA GAT CTC 
Arg Phe Arg lie Arg Gin Leu Val Arg He Asp His lie Pro Asp Leu 
130 135 140 
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CCA CTG CCT AAA CCT CTG ATC TCT TAT ATC CGA AAG TTC TAG TAC TAT 4 30 

Pro Leu Pro Lys Pro Leu lie Ser Tyr lie Arg Lys Phe Tyr Tyr Tyr 
145 ISO 155 160 

GAT CCT CAG GAA GAG GTA TAC CTG TCT CTA AAG GAA GCG CAG CGT CAG 528 
Asp Pro Gin Glu Glu Val Tyr Leu Ser Leu Lys Glu Ala Gin Arg Gin 
165 170 175 

TTT CCA AAC AGA AGC AAG AGG TGG AAC CCT CCA CGT AGC GAG GGG CTC S76 
Phe Pro Asn Arg Ser Lys Arg Trp Asn Pro Pro Arg Ser Glu Gly Leu 
180 185 190 

CCT GCT GGT CAC CAC CAA GGG CAT TTG GTT GCC AAG CTC CAG CTT TGAAGAACCA 
631 

Pro Ala Gly His His Gin Gly His Leu Val Ala Lys Leu Gin Leu 
195 200 205 

AATTAAGCTA CCATGAAAAG AAGAGGAAAA GTGAGGGAAC AGGAAGGTTG GGATTCTCTG 691 

TGCAGAGACT TTGGTTCCCC ACGCAAGCCC TGGGGCTTGG AAGAAGCACA TGACCGTACT 751 

CTGCGTGGGG CTCCACCTCA CACCCACCCC TGGGCATCTT AGGACTGGAG GGGCTCCTTG 811 

GAAAACTGGA AGAAGTCTCA ACACTGTTTC TTTTTCA 848 



(2) INFORMATION FOR SEQ ID NO: 36: 

— -(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(iij MOLECULE TYPE : protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:3S: 

Leu Glu Lys Cys Gly Trp Tyr Trp Gly Pro Met Asn Trp Glu Asp Ala 
15 10 15 

Glu Met Lys Leu Lys Gly Lys Pro Asp Gly Ser Phe Leu Val Arg Asp 
20 25 30 



Ser Ser Asp Pro Arg Tyr lie Leu Ser Leu Ser Phe Arg Ser Gin Gly 
35 40 45 
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Ile Thr His His Thr Arg Met Glu His Tyr Arg Gly Thr Phe Ser Leu 
50 55 60 

Trp Cys His Pro Lys Phe Glu Asp Arg Cys Gin Ser Val Val Glu Phe 
65 70 75 80 

lie Lys Arg Ala lie Met His Ser Lys Asn Gly Lys Phe Leu Tyr Phe 
65 90 95 

Leu Arg Ser Arg Val Pro Gly Leu Pro Pro Thr Pro Val Gin Leu Leu 
100 105 110 

Tyr Pro Val Ser Arg Phe Ser Asn Val Lys Ser Leu Gin His Leu Cys 
115 120 125 

Arg Phe Arg lie Arg Gin Leu Val Arg lie Asp His He Pro Asp Leu 
130 135 140 

Pro Leu Pro Lys Pro Leu lie Ser Tyr He Arg Lys Phe Tyr Tyr Tyr 
145 150 155 160 

Asp Pro Gin Glu Glu Val Tyr Leu Ser Leu Lys Glu Ala Gin Arg Gin 
165 170 175 

Phe Pro Asn Arg Ser Lys Arg Trp Asn Pro Pro Arg Ser Glu Gly Leu 
180 185 190 

Pro Ala Gly His His Gin Gly His Leu Val Ala Lys Leu Gin Leu 
— 195 200 205 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
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GTTCCAAGCC TAACCCATCT TTGTCGTTTG GAAATTCGGG CCAGTCTAAA AGCAGAGCAC 
CTTCACTCTG ACATTTTCAT CCATCAGTTG CCACTTCCCA GAAGTCTGCA GAACTATTTG 
CTCTATGAAG AGGTTTTAAG AATGAATGAG ATTCTAGAAC CAGCAGCTAA TCAGGATGGA 
GAAACCAGCA AGGCCACCTG ACACAGGTCC TTTAATTCTG TTTAGTCACA AAAGACGGCT 
TGTGTGACTG TTTGGATTTG GTGATCAAAT GTCCATGTTT ACAGTTGCTT TTCCCAGTTT 
GTGTCTTTCC CAATATTGTG AACCTTATCC ATCTTGCCTT ACTCAGTTTT ATTTCTAGTG 
CACTTTGTTG TGTATTATTT GTTTACCTGA CCATTTTCTA CTTTATTCTG CTAATAAACT 
GTAATTCTGA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA 
(2) INFORMATION FOR SEG ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 747 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

fxil SEQUENCE DESCRIPTION: S5Q ID NO: 38: 
GGGGATCGAA AGCGGGGGCT TCTGGGACGC AGCTCTGGAG ACGCGGCCTC GGACCAGCCA 
TTTCGGTGTA GAAGTGGCAG CACGGCAGAC TGGTCAAACA AATGGATTTT ACAGAGGCTT 
ACGCGGACAC GTGCTCTACA GTTGGACTTG CTGCCAGGGA AGGCAATGTT AAAGTCTTAA 
GGAAACTGCT CAAAAAGGGC CGAAGTGTCG ATGTTGCTGA TAACAGGGGA TGGATGCCAA 
TTCATGAAGC AGCTTATCAC AACTCTGTAG AATGTTTGCA AATGTTAATT AATGCAGATT 
CATCTGAAAA CTACATTAAG ATGAAGACCT TTGAAGGTTT CTGTGCTTTG CATCTCGCTG 
CAAGTCAAGG ACATTGGAAA ATCGTACAGA TTCTTTTAGA AGCTGGGGCA GATCCTAATG 
CAACTACTTT AGAAGAAACG ACACCATTGT TTTTAGCTGT TGAAAATGGA CAGATAGATG 
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TGTTAAGOCT GTTGCTTCAA CACGGAGCAA ATGTTAATGG ATCCCATTCT ATGTGTGGAT 540 

GGAACTCCTT GCACCAGGCT TCTTTTCAGG AAAATGCTGA GATCATAAAA TTGCTTCTTA 600 

GAAAAGGAGC AAACAAGGAA TGCCAGGATG ACTTTGGAAT CACACCTTTA TTTGTGGCTG 660 

CTCAGTATGG CCAAGCTAGA AAGCTTTGAA GCATACTTAT TTCATCCGGG TGCAAATGTC 720 

AATTGTCAAG CCTTGGACAA AGCTACC 747 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1013 base pairs 
(0) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

CACAAATGGG ACCATACAAA AATCTTGGAC TTGTTAATAA CCACTTACTA ACCGGGACCT 60 

GTGACACTGG GCTAAACAAA GTAAGTCCCT GTTTACTCAG CAGTGTTTGG GGGACATGAA 120 

GGATTeeCTA GAAATATTAC TCCGGAATGG TCTACAGCCC AGACGCCCAG GCGTGCCTTG 180 

TTTTTGGATT CAGTTCTCCT GTGTGCATGG CTTTCCAAAA GGAGGTGGAG CTGTAGTTCT 240 

TTGGAATTGT GAACATTCTT TTGAAATATG GAGCCCAGAT AAATGAACTT CATTTGGCAT 300 

ACTGCCTGAA GTACGAGAAG TTTTCGATAT TTCGCTACTT TTTGAGGAAA GGTTGCTCAT 360 

TGGGACCATG GAACCATATA TATGAATTTG TAAATCATGC AATTAAAGCA CAAGCAAAAT 420 

ATAAGGAGTG GTTGCCACAT CTTCTGGTTG CTGGATTTGA CCCACTGATT CTACTGTCCA 480 

ATTCTTGGAT TGACTCAG TC AGCATTGACA CCCTTATCTT CACTTTGGAG TTTACTAATT 540 

GGAAGACACT TGCACCAGCT GTTGAAAGGA TGCTCTCTGC TCGTGCCTCA AACGCTTOGA 600 

TTCTACAGCA ACATATTGCC CACTGTTCCA TCCCTGACCC ATCTTTGTCO TTTGGAAATT 660 
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CGGTCCAGTC TAAAATCAGA ACGTCTACGG TCTGACAGTT ATATTAGTCA GCTGCCACTT 720 

CCCAGAAGCC TACATAATTA TTTGCTCTAT GAAGACGTTC TGAGGATGTA TGAAGTTCCA 780 

GAACTGGCAG CTATTCAAGA TGGATAAATC AGTGAAACTA CTTAACACAG CTAATTTTTT 840 

TCTCTGAAAA ATCATCGAGA CAAAAGAGCC ACAGAGTACA AGTTTTTATG ATTTTATAGT 900 

CAAAAGATGA TTATTGATTG TCAGATAGGT TAGGTTTTGG GGGGCCAGTA GTTCAGTGAG 960 

AATGTTTATG TTTACAACTA GCCTTCCCAG TAAAAAAAAA AAAAAAAAAA AAAAAAAA 1018 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1897 base pairs 
tff (B} TYPE: nucleic acid 

I'Tl 

h % (C) STRANDEDNESS: single 

^ (D) TOPOLOGY: linear 

L5 (ii) MOLECULE TYPE: DNA 

Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

M - CGGGGGGCTG GGACCTGGGG CGTAACCGTC TCTACCACGA CGGCAAGAAC CAGCCAAGTA 60 

M AAACATACCC AGCCTTTCTG GAGCCGGACG AGACATTCAT TGTCCCTGAC TCCTTTTTCG 120 

TGGCCCTGGA CATGRATGAT GGGACCTTAA GTTTCATCGT GGATGGACAG TACATGGGAG 180 

TGGCTTTCCG GGGACTCAAG GGTAAAAAGC TGTATCCTGT AGTGAGTGCC GTCTGGGGCC 240 

ACTGTGAGAT CCGCATGCGC TACTTGAACG GACTTGATCC TGAGCCCCTG CCACTCATGG 300 

ACCTGTGCCG GCGTTCGGTG CGCCTAGCGC TGGGAAAAGA GCGCCTGGGT GCCATCCCCG 360 

CTCTGCCGCT ACCTGCCTCC CTCAAAGCCT ACCTCCTCTA CCAGTGATCC ACATCCCAGG 420 

ACCGCCATAC GACAGCCATC TGGTGCCAAR TCACTGAGCC CGTTGGGGTC CGCCGACCCC 480 

TGCGCCTGGG ATGGAAGCCC ACCTCAGCCA TGGGCAGACG TGCCCCCTCA TCCTACCGGC 540 
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TGCCTCTGCT GGGGGAACCT ATGCCAACGG ACTTCTCCCT TCCCAACACT GGCTGAAGCA 

GCAGCACCCA GGCCCTTCCC TGAACCAGAT GCAGAGAATA AACTATGAAA ACCTCTCTCA 660 

GGCGCCTTCT GCTCTCAGGT GGAGTGGGCT GCCCCCCACT CTCTGCAGAG AGAGGCTACA 720 

CCCACCTGGG GGGTCCTGGG AGGTAAGACT AGTAGGAGGT GCCAGGGCTG ARTCCAAAAG 780 

CAGGAATGGC CAGGAMCAGG CCATACAGAT GAAGCTCAGG ATGTCACATA CCATGGACAM 840 

TGAGACAGAA CCCCAGGTTG GAMTTCCCTT GGGCCAACGA GTGCCAGCTT TAATGTCAGC 900 

TGCMGGTGCT CTGTGGCCTG TATTTATTCT TTAAACAGTA GCAAAGGCCA TTTATTTATT 960 

CCACTTAGAA AGGAAACCTT GGTGGGTGGY TTCCCTCGAT GTGCTTTCCC CCACCTCCCT 1020 

GGAATGTGTG TGCCACACCT GTCCTTGTCC CAGGCCAGGA CTGTGGCACA TGAGCTGGTG 1080 

TGCACAGATA CACGTATGTC GTCGTGCAXG ACCCCTGACT AGTTCCTAAG TAGCCCTGCA 1140 

CCAAGCACCA GAGCAGACCC CAAGAGAGGC CCGTGCAAGT CCCCATGTCC CCAGGTCCCT 1200 

GCTTCTGTTG CCTTGGGACT CATACACCGG CACACGTGTT TCAGCCTCTT GACTTCCATG 1260 

AGCTTCGAAT TTTGCCCCCG ATTCTTCTGA TATTTCCCAT TGGCATCCTC CAAAGCTCTG 1320 

GGCCTGGAGG GCATTAGGAC ACATGGAATG AGTGGGGTCT CCAGCCCCTG GGAAAGCCAC 1380 

TGGCAAGGCA GGATTAGAAA GACCAAGAGC AGGGTGGGGC GCCATGAAGC CTGTATGCCT 1440 

CTCAGGCTCA AGACCCCGCC ACACACCCAC TCAAGCCTCA GAAGTGGTGT GTAGGGCAGC 1500 

CCCAGGAGAG GAATGCCTGT CCTAGCAGCA CGTACATGGA GCACCCCACA TGTGCTCCAG 1560 

CCCTCTGGCT GTTTCTCTTG CTCTAGAATC AACTCCCTAC ATTGGGAATG TAGCCATTTG 1620 

GTAGAGGACT TGCCTAGCCT GCAGGAAGCT CACGTTCCAT CCCCTGCACC AAGGAGAATC 1680 

AAAGCTCAGG AGGCTGAGGC AGGAGGATTG CTGTCAGTGG TGTACAGAGG TCATGGCCAT 1740 

CCTGGGCTAT ATTAAACCTT GTCCTTTAAG AAAAAGAAAA GAAATCAACT TCCATTGAAT 1800 

CTGAGTTCTG CTCATTTCTG CACAGGTACA ATAGATGACT TKATTTGTTG AAAAATGKTT 1860 

AATATATTTA CMTATATATA TATTTGTAAG AAGCATT 1897 
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{2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



{Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Gly Gly Trp Asp Leu Gly Arg Asn Arg Leu Tyr His Asp Gly Lys Asn 
1 5 10 15 

Gin Pro Ser Lys Thr Tyr Pro Ala Phe Leu Glu Pro Asp Glu Thr Phe 
20 25 30 

He Val Pro Asp Ser Phe Phe Val Ala Leu Asp Met Xaa Asp Gly Thr 
35 40 45 

Leu Ser Phe He Val Asp Gly Gin Tyr Met Gly Val Ala Phe Arg Gly 
50 55 60 

Leu Lys Gly Lys Lys Leu Tyr Pro Val Val Ser Ala Val Trp Gly His 
65 70 75 80 

Cys Glu He Arg Met Arg Tyr Leu Asn Gly Leu Asp Pro Glu Pro Leu 
85 90 95 

Pro Leu Met Asp Leu Cys Arg Arg Ser Val Arg Leu Ala Leu Gly Lys 
100 105 HO 

Glu Arg Leu Gly Ala He Pro Ala Leu Pro Leu Pro Ala Ser Leu Lys 
115 120 125 

Ala Tyr Leu Leu Tyr Gin 
130 



(2) INFORMATION FOR SEQ ID NO: 42; 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
AAGGGTAAAA AACTGTATCC TGTAGTGAGT GCCGTCTGGG GCCACTGTAG ATCCGAATGC 
GCTACTTGAA CGGACTCGAT CCCGAGACTG CCGCTCATGG ATTTGTGCCG TCGCTCGGTG 
CGCCTGGCCC TGGGGAGGGA GCGCCTGGGG GAGAACCACA CCTGCCGCTG CCGGCTTCCC 
TCAAGGCCTA CCTCCTCTAC CAGTGACGTT CGCCATCATA CCGCCAGCGC GACAGCCACC 
TGGTGCCAAC TCACTGAGCC GCCTG 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 8 base pairs 

(B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43; 

AAGTGGCGGC GGTCCCTGGA GAGCAGGCGG AGGCAGCGGC AAGTCTGACT CTGCGCTGAC 60 

CGTGGAGCCG GGGCGGGGGC TGACAGCCAG GCCTCCGCCT GGCGGGAGCC GCACGAGGAG 120 

CGGGAGTGGC CGGGCCTCTC TTCCGCGCTT GAGCGAGCGC CGGGTGATGG CGGTGGTGAT 180 

GGCGGCAGGC GCTCGGACAG CTCCGCTTGA GCTGAGCTCG GAGAGATCCG TCCAGAAAGT 240 
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GCCCAGAAGA AACTTCCTCT TAGAAAAGCT GAAAAACACA RTATTTATAA CACTGGAAAT 300 

TGTAAAGAAT TTGTTTAAAA TGGCTGAAAA CAATAGTAAA AATGTAGATG TACGGCCTAA 360 

AACAAGTCGG AGTCGAAGTG CTGACAGGAA GGATGGTTAT GTGTGGAGTG GAAAGAAGTT 420 

GTCTTGGTCC AAAAAGAGTG AGAGTTGTTC TGAATCTGAA GCCATAGGTA CTGTTGAGAA 480 

TGTTGAAATT CCTCTAAGAA GCCAAGAAAG GCAGCTTAGC TGTTCGTCCA TTGAGTTGGA 540 

CTTAGATCAT TCCTGTGGGC ATAGATTTTT AGGCCGATCC CTTAAACAGA AACTGCAAGA 600 

TGCGGTGGGG CAGTGTTTTC CAATAAAGAA TTGTAGTGGC CGACACTCTC CAGGGCTTCC 660 

ATCTAAAAGA AAGATTCATA TCAGTGAACT CATGTTAGAT AAGTGCCCTT TCCCACCTCG 720 

CTCAGATTTA GCCTTTAGGT GGCATTTTAT TAAACGACAC ACTGTTCCTA TGAGTCCCAA 780 

CTCAGATGAA TGGGTGAGTG CAGACCTGTC TGAGAGGAAA CTGAGAGATG CTCAGCTGAA 840 

ACGAAGAAAC ACAGAAGATG ACATACCCTG TTTCTCACAT ACCAATGGCC AGCCTTGTGT 900 

CATAACTGCC AACAGTGCTT CGTGTACAGG TGGTCACATA ACTGGTTCTA TGATGAACTT 960 

GGTCACAAAC AACAGCATAG AAGACAGTGA CATGGATTCA GAGGATGAAA TTATAACGCT 1020 

GTGCACAAGC TCCAGAAAAA GGAATAAGCC CAGGTGGGAA ATGGAACAGG AGATCCTGCA 1080 

GTTGGKGGCA CCTCCTAAGT TCCACACCCA GATCGACTAC GTCCACTGCC TTGTTCCAGA 1140 

CCTCCTTCAG ATCAGTAACA ATCCGTGCTA CTGGGGTGTC ATGGACAAAT ATGCAGCCGA 12 00 

AGCTCTGCTG GAAGGAAAGC CAGAGGGCAC CTTTTTACTT CGAGATTCAG CGCAGGAAGA 1260 

TTATTTATTC TCTGTTAGTT TTAGACGCTA CAGTCGTTCT CTTCATGCTA GAATTGAGCA 1320 

GTGGAATCAT AACTTTAGCT TTGATGCCCA TGATCCTTGT GTCTTCCATT CTCCTGATAT 1380 

TACTGGGCTC CTGGAACACT ATAAGGACCC CAGTGCCTGT ATGTTCTTTG AGCCGCTCTT 1440 

GTCCACTCCC TTAATCCGGA CGTTCCCCTT TTCCTTGCAG CATATTTGCA GAACGGTTAT 1500 

TTGTAATTGT ACGACTTACG ATGGCATCGA TGCCCTTCCC ATTCCTTCGC CTATGAAATT 1560 

GTATCTGAAG GAATACCATT ATAAATCAAA AGTTAGGTTA CTCAGGATTG ATGTGCCAGA 1620 
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GCAGCAGTGA TGCGC AGAGG TTAGAATGTC GACCTGCATA CATATTTTCA TTTAATATTT 



TATTTTTCTT ATGCC TCTTT GAATTTTTGT ACAAAGGCAG TTGAATCAAA TAAAACTGTG 



CCCTAAGTTT TAATT 
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CAGA TCAATTTATT TTTTTTATGA TACACTTGTT ATATATTTTT 



AAGCAGGTGT TTGGT TTGT TTTTACCATA TAAATTTACA TATGGTCCAG GCATATTTAC 
AATTTCAAGG CATTGC ATAT ACATTTGAAT ATTCTGTATT TTTTAAATAA TCTTTTGTTC 



TTTCCTATGT GTG AAA TATT TTGCTAATCT ATGCTATCAG TATTCTTGTA TGACCGAATA 



GTTACCTATT CTCTTT7CAT CTTGAAGATT TTCAGTAAAG AGTGTTGTAA TC AATCCATT 



ATAATGTAAT TOACTTVTGT AATTTGCCAA TAGGAGTGTT AAACAACAAA ATGATTTAAA 



ATGAAACTTA ATGTAT! TTC ATTTTAAATA TTAACTAAAC CAAGTTTGTT TGTTAGTTAT 



TCTAGCCAAT AAGAAAf 



AGGACCGTCA GTTAGTC 



GAG AATGTAGCAT CCTAGAGGTG TATTTGTTCT GCAGTTTGGC 



CAA ATAAACATCC CCTCAGCGTG GAGGCGAATG GAACCTGTGC 



TCCTTTCTTA CGGGAAG CTT TGCAAAGCAA AATAGCAGGG TTACAAGCTT GGAGTTGTTA 



AGGCAACTAG AGTTTTC TCT ATTAATTTAT AGACTGTTGT TGCACCTACT TAGCTCTTTT 



TTGGGAACTC TAGTTCC CAG GGGAAAATAC CTCGTGCC 



(2) INFORMATION FCR SEQ ID NO:44 



CHARACTERISTICS : 



(i) SEQUENCE 

(A) LENGfTH 
(BJ TYPE 
(C) 

(D) TOPOLOGY 



(ii) MOLECULE 



(xi> SEQUENCE 

Ser Gly Gly 
1 



: 542 amino acids 
amino acid 

single 
: linear 



STRAKDEDNESS 



m>E: DMA 



DESCRIPTION: SEQ ID NO: 44: 



Gfy Pro Trp Arg Ala Gly Gly Gly Ser Gly Lys Ser Asp 
5 10 15 
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Ser GlJ[ Leu Thr Val Glu Pro Gly Arg Gly Leu Thr Ala Arg Pro Pro 
20 25 30 

Pro Gly I Gly Ser Arg Thr Arg Ser Gly Ser Gly Arg Ala Ser Leu Pro 
35 40 45 

Arg Leu Ser Glu Arg Arg Val Met Ala Val Val Met Ala Ala Gly Ala 
50 I 55 60 

Arg Thr kla Pro Leu Glu Leu Ser Ser Glu Arg Ser Val Gin Lys Val 
65 I 70 75 80 

Pro Arg ^rg Asn Phe Leu Leu Glu Lys Leu Lys Asn Thr Xaa Phe He 
85 90 95 

Thr Leu cjlu He Val Lys Asn Leu Phe Lys Met Ala Glu Asn Asn Ser 
100 105 110 

Lys Asn vial Asp Val Arg Pro Lys Thr Ser Arg Ser Arg Ser Ala Asp 
its 120 125 

Arg Lys Afep Gly Tyr Val Trp Ser Gly Lys Lys Leu Ser Trp Ser Lys 
130 I 135 140 

Lys Ser Glki Ser Cys Ser Glu Ser Glu Ala He Gly Thr Val Glu Asn 
145 1 150 155 160 

Val Glu 11$ Pro Leu Arg Ser Gin Glu Arg Gin Leu Ser Cys Ser Ser 
165 170 175 

He Glu Leu] Asp Leu Asp His Ser Cys Gly His Arg Phe Leu Gly Arg 
180 185 190 

Ser Leu Lys I Gin Lys Leu Gin Asp Ala val Gly Gin Cys Phe Pro He 
195 200 205 

Lys Asn Cys Ser Gly Arg His Ser Pro Gly Leu Pro Ser Lys Arg Lys 
210 215 220 



He His He Ser Glu Leu Met Leu Asp Lys Cys Pro Phe Pro Pro Arg 
225 I 230 235 240 



Ser Asp Leu Ala Phe Arg Trp His Phe lie Lys Arg His Thr Val Pro 
\ 245 250 255 
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Met sir Pro Asn Ser Asp Glu Trp Val Ser Ala Asp Leu Ser Glu Arg 



Lys Lea 



Pro Cy 
29 

Ser Ali 
305 



260 



265 



270 



Arg Asp Ala Gin Leu Lys Arg Arg Asn Thr Glu Asp Asp He 

2B0 285 



275 



Phe Ser His Thr Asn Gly Gin Pro Cys Val He Thr Ala Asn 

295 3°° 

Ser Cys Thr Gly Gly His lie Thr Gly Ser Met Met Asn Leu 

310 315 320 



Val Thr| Asn Asn Ser He Glu Asp Ser Asp Met Asp Ser Glu Asp Glu 
32S 330 335 

He He \rhr Leu Cys Thr Ser Ser Arg Lys Arg Asn Lys Pro Arg Trp 
340 345 350 

Glu Met Alu Glu Glu He Leu Gin Leu Glu Ala Pro Pro Lys Phe His 
IsS 360 365 

Thr Gin lie Asp Tyr Val His Cys Leu Val Pro Asp Leu Leu Gin He 
370 I 375 380 

Ser Asn Asli Pro Cys Tyr Trp Gly Val Met Asp Lys Tyr Ala Ala Glu 
385 1 390 395 400 

Ala Leu Lei Glu Gly Lys Pro Glu Gly Thr Phe Leu Leu Arg Asp Ser 
405 410 415 

Ala Gin Gl4 Asp Tyr Leu Phe Ser Val Ser Phe Arg Arg Tyr Ser Arg 
420 425 430 



Ser Leu His I Ala Arg He Glu Gin Trp Asn His Asn Phe Ser Phe Asp 
435 1 440 445 

Ala His Asp Wo Cys Val Phe His Ser Pro Asp He Thr Gly Leu Leu 
450 \ 455 460 

Glu His Tyr lU Asp Pro Ser Ala Cys Met Phe Phe Glu Pro Leu Leu 
465 \ 470 475 480 

Ser Thr Pro Leu He Arg Thr Phe Pro Phe Ser Leu Gin His He Cys 



48S 



490 



495 
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Ai4 Thr val He Cys Asn Cys Thr Thr Tyr Asp Gly lie Asp Ala Leu 
500 505 510 

Pro \ He Pro Ser Pro Mec Lys Leu Tyr Leu Lys Glu Tyr His Tyr Lys 
515 520 525 

Ser Lys Val Arg Leu Leu Arg He A9p Val Pro Glu Gin Gin 
530 535 540 



(2) INFORMATION FOR SEQ ID NO: 45: 



(i) SEQUENCE CHARACTERISTICS: 

i) LENGTH: 4S99 base pairs 
(b) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(I)) TOPOLOGY: linear 

(ii) MOIECULE TYPE: DNA 



(xi) SEQJENCE DESCRIPTION: SEQ ID NO: 45; 



CCCTCTGGGC AAGCCGCCCC CCCCCCACCC ATCTACCACA CACACACACA CACACACACA 60 

CACACATTCA G^CCTTGGGG CAAAAACAAA GCAAAATAAC AACAACAAAA ACACTGCCTG 120 

TGGAAAGTCC TTACTTCAGG AAGGTTGGCA GATGAGGAGC AAGGGAACAT TTTATCAGGA 180 

CTGCCACAAA GOAGTCTTTT TTTTTAATGG TTTTTCAAGA CAGGGTTTCT CTGTATAGCC 240 

CTGGCTGTCC TCGAGCTCAC TTTGTAGACC AGGCTGGCCT CGAACTCAGA AATTCGCCTG 300 

CCTCTGCCTC C]*GAGTGCTG GGATTAAAGG CGTGCAGCAC CATGTCCAAC TGGCATTTTC 360 

TCAATTAAGG TTCGTTCCTT TCAGATAACT CTAGGTTCTG GGTCAAGCTG ACACAAGGCT 420 

ACACAGCACA GmTGTATGC CACATTCAGT TCAGAAGACA CCCAACCTCC CTGGAACTGG 480 

AACTTATGCA CATTTGTGAG CTTCCACTTG GGAGTGGGAA CCTGAACTGG GTCCTCTGCA 540 

AGAGCAGCCG TGCTCTTAAC TGCTGAGCCA TTTCAGCAGC CTCACATCAG AATTAAGTTA 600 
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GAAATTAGCCG GGTATGAATC ATACCCTTAG AATCCTAGCA TCTGAAAGCA GAGCTAAGAG 660 

AAACAGGGAil TCAAGACCAG CTCTTGGCTA CAGAGCCCGT CCTGTCCTAG GATGGGCTAC 720 

AAGAGACTAT WtCAAAGCCA TCCAAACAAC AATAACTACA ACAACAACAA GGTTAAAATT 780 

AGGCTGGGCA CAGGGTACAC AC CTTTAATG CCAACACTCA GGAGGCAGAG GCAGGCTGAT 840 

CAGTGTGAGT TOGAGTTCAA CGTGGTCTAC ATAGGGAGTT CTAGGCCAGC AGAGGTTACA 900 

GTCTCTCTCT cicTCTCTCT CTCTCTCTCT CTCTCACACA CACACACACA CACACACACA 960 

CACACACACA CACACACGGT GGCATTATGG GATTTTTTTG GGATAAGGTT TCTCTGTCTA 1020 

GCCCTGGCAT AGATTCACTC TGTAGACTAG GCTAGCCTTG AACTCAGAGA TCCGCCTGCC 1080 

TCTGCCTCCC AAGTGCTGGG ATTATAGGTG TTGCACCACC ACTGCCCAGC CACTTTGGGA 1140 ' 

TTTTTGAACT GTTATCAAGA GGCTTTCGAG GAGGTCAAAC TTCAACAGCA ACCTCTCCAT 1200 

GATAATGTAG CTAATGATCA AACGACACTC AAAACTTAAC CCTTAAAGCA CACATCCACC 1260 

AGACAGCGTG CCCACTCGTA GTTCCATTAC TCAGGAGGCT GAAGCAGGAG GATGAAGGAC 1320 

TAAGGCTTCA GCAAeCTAGG GAGCCGCAGG GGACAGTAGT CTCAATCCCT ACATTCTCCT 1380 

GAACACAGGA GCAGckGTTC AGGAAGGGTG TCAAGGCCGC TTACTGATCT TAGGGCCTCA 1440 

GGAATGACTA GCTCAGGCAG AGAGAGCAAA GGTCTCCAGT GGAGAAGTCT ACACACACAC 1500 

ACACACACAC ACACAQACAC ACACACACAC AGAATCCAAG GCGATGACGT CATCAAAGGG 1560 

TTAATTCTAG TCTGGGATGG GGGGGAGGGT GGGGCACGCA GCTGTCAGGT GGCTTTGGAA 1620 



AAATAAACTG CTGAAG^GTC TGACGCCAGG GAGTCCTGGG AGGGACAAGA GGTTACCCAC 1680 



TCAAAGAGTG TGCTCCJ 



CAA AGCATGCGCG CTTGTCCACG TCTGGAGTCG TCACTTATTT 1740 



TTTGCCTGGA TTCTTTC TAG CCGGTGGGTT CTCAAGGCGG TAAGTGGTGT GGCCGCCGTG 1800 

GTCTGGGAGG TGACGA1AGG GTTAATCGTC CACAGAGCCC AGGGGCGGAG CGCGGGCGGG 1860 

CGTCCGCAGC CCCGCTCGAG CCGGAAGCAG TGGCTGGTCA GGGGCGCTTC TAGCCTTCCC 1920 

TATCTGTACT TCCACAgWgG TCTCTGCGAG CTAGGGGGAC AGTGAGGTGC GGGGTAGGGG 1980 
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CCCGGCGTTA AaGCCAGCAA GGGGACGGTT CACGGTAAGG TCTGAGGGAG AGAGAGCTCC 204 0 

TGAGAAACTT GGGGGGCGCG ACACAGATAG GGTGAAAGCA GAGTGATAGA CCTGGGATGG 2100 

TTAGGGCACC aIgGGAAGAC CAGGCTGGTT GGCATACACC GGTGAACGGA TGGGAGTCCT 2160 

AGGGAAAGAT GATGCGCCTA ACAGTCCTTT CTGTCTCCAC ACCACTCCAG GGGACGATCC 2220 

GGAGC TCAAC TTTCAAAAGC GAGACGCCCC AGCAAGCCTG TTTTGAGAAG TTCTTCAGCG 2280 

GCTCTCCTCA TGGGCCAGAC GGCCCTGGCA AGGGGCAGCA GCAGCACCCC TACCTCGCAG 2340 

GCTCTGTACT CGGACTTCTC TCCTCCCGAG GGCTTGGAGG AGCTCCTGTC TGCTCCCCCT 2400 

CCTGACCTGG TTGCOCAACG GCACCACGGC TGGAACCCCA AGGATTGCTC CGAGAACATC 24 60 

GATGTCAAGG AAGGGsGTCT GTGCTTTGAG CGGCGCCCTG TGGCCCAGAG CACTGATGGA 2520 

GTCCGGGGGA AACGGGGCTA TTCGAGAGGT CTGCACGCCT GGGAGATCAG CTGGCCCCTG 2580 

GAGCAAAGGG GCACACjACGC CGTGGTGGGC GTGGCCACCG CCCTCGCCCC GCTGCAGGCT 2640 

GACCACTATG CGGCGuTTTT GGGCAGCAAC AGCGAGTCCT GGGGCTGGGA TATTGGGCGG 2700 

GGAAAATTGT ATCATCAGAG TAAGGGCCTC GAGGCCCCCC AGTATCCAGC TGGACCTCAG 2760 

GGTGAGCAGC TAGTGGTGCC AGAGAGACTG CTGGTGGTTC TGGACATGGA GGAGGGGACT 2820 

CTTGGCPACT CTATTGGGGG CACGTACCTG GGACCAGCCT TCCGTGGACT GAAGGGGAGG 2880 

ACCCTCTATC CCTCTGTAAG TGCTGTTTGG GGCCAGTGCC AGGTCCGCAT CCGCTACATG 2940 

GGCGAAAGAA GAGGTGAGAT ACGGACTAGG TGTGGGGAGA TCACTACTCT TGGCAATGGT 3000 

TTGGGCTGGA AACTCATOGT TGGAGCACAG GAAGTAGGCT TCTTGTCACT TTGGCCTGTC 3060 

ACTTAGATGG CCTTGGATCT AGCTTCACTC CCAATCCCTA TTGGATGTGA TGCACAAATT 3120 

CAGAGCCTTT GGGTCTCCGT CAGCTGAGGT GGCGGTGGAA ATGGAGGAAG AAGGAAGGGT 3180 

GCCTGAGCAG GATCTCAAGTy TCAAGGATGC CTGGAGTTGC TTACTTACCT TGTCTTCCTT 3240 

CTCTCTCCGC AGTGGAGGAA CCACAATCCC TTCTGCACCT GAGCCGCCTG TGTGTGCGCC 3300 

ATGCTCTGGG GGACACCCGG CTGGGTCAAA TATCCACTCT GCCTTTGCCC CCTGCCATGA 3360 



P:\OPER\EIH\lSOGi I. P*l v • 31/1 



173 



AGCGCTATCT GCTCTTACAAA TGACCCAGTA GTACAGGGTG TGCTGGCACC CTACCGTGGG 3420 
GACAGGTGGA GAGGOACCCG CTGGCCTAGA CAACTTTAAA AAGCTGGTGA AGCTGGGGGG 3480 

GGGGGGCTGG ACCCCtTTCAC CTCCCCTTCT CACAGGAGCA AGACATATAG AAATGATATT 3540 

AAACACCATG GCAGCCTTGGG ACAAAGAGGT TTTTGAAGTA AAAAATGAGA TGTATTGTCA 3600 

CAACCTGTTT CATTATjTGTT TTXTGTTTTG TTTTACACTC CCCCACCCCA GGCTAGAGCC 3660 

CCATCACTGT CTTAAGGAAT TATGACAACC CACAAAGCTC AGGCCCAGGT GTTTATTTCC 3720 

CTTACATGTA GGATGGttTCA CAAACACAAT ACAGGGGCTT TGGCACCGTG GGGGAGGGGA 3780 

CTATCCCAGG CCTCTTAGGG TCTCATGTAT ACCGAATTCA GACCCGAAAG CTCTGAATTT 3840 

CTGCATCAGA CATCCAGTAG AACTTGGGAG TGAAGCTAGA GCCAAGGCCA TCTAAGTGAC 3900 

AGGCCAAAGT GACACGAAGC CCACTTCCTG TGCTCCAACC ATGAGTTTCC AGCCCAAACC 3960 

AATGGAAGGT GATTTCACTT GTCAGGGCCC AAAGGGACAG TCAGTTCTAC TCCCTCCCCT 4020 

CACTAGGAGC CACCTTGpTG ACAGTTGATT CTACCCACTG TAAGTGGTAA AGGGATTGGC 4080 

CTGGTCCCAA CCATAATAGG GCGGTGGAAA CGGCTCAGGA GGGTACAGCG TGGATTAGGC 4140 

CACAAGATGG GGCAGATGAT GTCATCAGAA GCATGTGACC GGTGGGAGCA GTTACTAAAC 4200 

TTCTGGGCAA CCTAGTCCAT GCTATGCAGG CAGGTAGAGG GATGGGCAGT GCTCATTGTT 4260 



TGGCATTGAT GATGTCCACA AATTCAGGCT TGAGAGATGC GCCACCCACA AGGAAGCCGT 4320 

CCACGTCAGG CTGGCTTSCC AGCTCTTTGC AGGTTGCTCC AGTCACAGAA CCTGTACCAG 4380 

GAACAAGAAG ACAGTTTGGT CAGGTCTATG ATCAGAACAC TTAAGCCCCA CCTCTCTGTG 4440 

CAAGGCAGCC TCAGTCTGTC TTAGCCCATT TCCGTCTTAG CTAGAGCCAA AGCCACTCAC 4500 

CTCCATAAAT GATCCGGGTG CTCTGAGCCA CCCCATCATT GACATTGGAT TTCAGCCATC 4560 

CCCGGAGCTT CTCGTG1ACT TCCTGTGCCT AGAAGGAGGA GGCAGAGCTA CTAAGTAAGC 4620 



TCCTTCCTAT CTATCAT^CA AGGAGTAAAA ACCACTGGTT CTCACATAGA GTTGAGTTTC 4680 
CAGAAAAGCC CCGGGACCAG AGAGTGGCAA GGCTCCAATC CCACCAGGCT TGGAATGAAC 4740 
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ATTTTTGGCA AAGTCACTCT CCTTGGTGAG TTTGGGGGCC CTCTGTCTCT AAAGGGGCTT 4800 

GGATGGGCTCl CATAGCTGTG TGAGTCTGTT AAAGC CGGAC AGGCTGAGGA GCTCTGGGTA 4960 

GTTACCTGCT IGAGGGGTTGC CGTCTTGCCA GTCCCAATGG CCCACACAGG TTCATAGGCC 4920 

AGGACCACCT TGCTCCAGTC TTTCACATTA TCTGTGGGGC AGAGAGGAGA GTGAGTAGGA 4980 



AGGAGCTGAC CCGCCAAGC 



4999 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) ^LENGTH: 2 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

<D) Topology : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Met Gly Gin Thr Ala Leu Ala Arg Gly Ser Ser Ser Thr Pro Thr Ser 
lis 10 15 

Gin Ala Leu T^r Ser Asp Phe Ser Pro Pro Glu Gly Leu Glu Glu Leu 
2P 25 30 

Leu ser Ala PjJo Pro Pro Asp Leu Val Ala Gin Arg His His Gly Trp 
35 1 40 45 

Asn Pro Lys AsA Cys Ser Glu Asn He Asp Val Lys Glu Gly Gly Leu 
50 I 55 60 

Cys Phe Glu Argl Arg Pro Val Ala Gin Ser Thr Asp Gly Val Arg Gly 
65 1 70 75 80 

Lys Arg Gly Tyr |Ser Arg Gly Leu His Ala Trp Glu He Ser Trp Pro 
S5 90 95 



Leu Glu Gin Arg dly Thr His Ala Val Val Gly Val Ala Thr Ala Leu 
100 \ 105 110 
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Ala Pro Leu 
115 

Glu Ser Trp 
130 

Lys Gly Leu 
145 

Leu Val Val 



Thr Leu Gly 



Gly Leu Lys 
195 



Gin Ala Asp His Tyr Ala Ala Leu Leu Gly Ser Asn Ser 
120 125 

ly Trp Asp He Gly Arg Gly Lys Leu Tyr His Gin Ser 
135 140 

(Jlu Ala Pro Gin Tyr Pro Ala Gly Pro Gin Gly Glu Gin 
ISO 155 160 

ijro Glu Arg Leu Leu Val Val Leu Asp Met Glu Glu Gly 
165 170 1*?5 

lirr Ser lie Gly Gly Thr Tyr Leu Gly Pro Ala Phe Arg 
B0 185 190 

Gly Arg Thr Leu Tyr Pro Ser Val Ser Ala Val Trp Gly 
200 205 



Gin Cys Gin V 
210 



Pro Gin Ser leu Leu His Leu Ser Arg Leu Cys val Arg His Ala Leu 



225 

Gly Asp Thr J 



(i) SEQUENCE 

(A) LENGTH 

(B) typ: 

(C) STRfcNDEDNESS 
(D) 



(ii) MOLECULE 



(Xi) SEQUENCE 



al Arg He Arg Tyr Met Gly Glu Arg Arg Val Glu Glu 
215 220 



230 



235 



240 



rg Leu Gly Gin He Ser Thr Leu Pro Leu Pro Pro Ala 
245 250 255 



Met Lye Arg ifyr Leu Leu Tyr Lys 
60 



(2) INFORMATION FOR SEQ ID NO: 47: 



CHARACTERISTICS : 

: 5615 base pairs 
: nucleic acid 

: single 
linear 



TOPOLOGY 



TYPE; DNA 



DESCRIPTION: SEQ ID NO: 47: 
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GTACTTTCTT TATATCTCCA TAATTTTATT TACTATTACT ACATGATACA TTATTTTATA 



60 



AAAGTCTTTG TAACCTCCTT AAGGATTCAC TGCTTAATCT CCAGTGCTTA GCACAAATCA 120 

TTAAATGCGA ACCAGAAACT CTTCCAAATG TGTTACATCT ATAACCTCAT TGGATTCTCA 180 

CTACCAACCC CATGCAATAG AT AC TAATGT GATCTCTGTC TTACAGAGGA AGAAACAGGC 240 

ACAGGGAGGT TCAGTAATTT GCCCAAGGTC ATACACACAC TGGCCTTCAG GTATTCATGC 300 

CCGGGGAGTC TGGTCCCACA GCTGGCATGT TTGCCATTA? ATTATATTGC CTCCTTATAG 360 

TGTCGGCACT CATTAAGCAC ATTGACAGCT ATGCTTGGTG AGTGACTACT ATGTACCCAG 420 



CTCTGTGCTA CATGCTTTAC CTGGATTATT TCAACTGCAC AACAACCCTG TGAGGTAACT 



480 



ACCATCATTG CTCCTATTTT ACATAACAGA AAACTACAGA AATCTGGGGC TGGGCGTAGT 540 



GGCTCATGCC TGAAATCCCA GCACTTTGGG AGACCCTGTC TCTAAAAAAA 



600 



GGCCGGACGT GGTGGCTCAC ACCTGTAATC TCAGCACTTT GGGAGGCTAA GGCAGGCAGA 



660 



TCACAAGGTC AGGAGTTCTA GACCAGCCTG GCCAACATGG CAAAACCCTG TGTCTACTAA 



720 



AAATACAAAA AATAGCTAGG CGTGGTGGCA GGTGCCTGTA ATCCCAGCTA CTCAGGAGGC 



780 



TGAGGCAGGA GAATCCCCTG AACCTGGGAG ATGGAGGTTA CAGAGAGCCG AGATCGTGCC 



840 



GCTGCACTCC AGCCTGGGCA ACAAGAGCAA GACTCTGTCT CGAAAAAAAT AAAAATAAAA 900 

ATAAAAATAT TTTTTTAAAA ATTAGCTGGG TGTGGTAGCA CATGCCTGTA GTCCCAGCTA 960 

CTTGGGAGGC TGAGGTAGGA GGATCACTTG AGCCCAGGAG GTCAAGGCTG CAGTGGGCTG 1020 

TGATGGCGCC ACTGCAC TCT AGCCTTGGTG ACAGCAAGAC CCTGTCTCAA AAAAAAAAAA 1080 

AAGAGAAATC GGGC AACTTC CCCAAGATCG CGCAGTTAAC TAGTGGCATA GCTTCACTCA 1140 

AACTCGAAGT CTTAATCAGG ACACTCTACC AAATGAGATC AACGGCTCAG TAATGGATTG 1200 

GCATCCAGTA TGAAGACTGG ACCAGCAGGG AGAACTATGA TGCGTACAGC CTAGAGCCTG 1260 

AAGCAGATTT CACAGCCTCA GAGGTGGCAC AGGCTGACTC ACAACCCGGG GCAGAAAGGG 1320 



ACCAGCCCAG AAACAGTGAC CCAGAATCAC AGGGAAGTAG AAATGGGATT CGGCACAATG 1380 
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AAGCCCCTCC TTGACCCCflT GCTCCTTACC CTCAGGGGCG CAGGAGTTAG TCGCTCAGGC 1440 



GGCTCAAAGG TCTTGACGGT GGAGAACACC ATCCCCAGGG ATTCCCGACG CGGTGATGCC 1500 
ATCAAAGCGT TAATTCTGAG ATGGGCCTGC CCGGGTGCGG ACTCTGCCGC AGCAAGAGAA 1560 



GGGTTAACTG CCCCGGGCCT 



GACTGAGACC CGAGGTTAAC 



CGGCTGCTGC CGGTATAGAG 



GGCCTCGGAG CTGCACGGCC 



AGGGGCGGGG CCGCGGGACG 



CGGTCTGCGC CCTGCGCGCC 



CTCCAGGGCT GGGAAGTCAA 



TCGCCGTGGG GGCGGGGCCT CGGGGAGGGT CACAGCCCGG 1620 



CGCCCGGGGT GGGCTCCACG GGGGCGGGGC ATGCTCTCCG 1680 



CGGTAACTGC CCAGGAGGGG GCGGGGCCCC ACAGGGGCGT 1740 



GTGGGCGGCG ATGAGAGGGT TAAGCCCCAG AGGGCCCTGG 1800 



GGCTCGGCCC AAGGGAGGAG CTGGGGGCGG AAGCGGCCGG 1860 



TCGGCTTCTT TCCGCCCGGC TCCTTCAGAG GCCCGGCGAC 1920 



CCGAGGTTCG GGGGCAGCGG CGAGGGCTCC GGGCGAGTAA 1980 



GGGGGATGGT CCATGCTGAG GCCCAAATGG GGCGAACTCG CGAGAGTCTC TGGCGACCTG 2040 
GATCAGATGG GGCGAGGGCA GATGAAGGGC CCAGGAGCTT TGGCGCAGCG AGGAGGGAGG 2100 



AGCGGGCCCG TTGGCAAACT 



AGGATTCTGC TCTTCACGCC 



GCTCAAGTTT CAGAAGAGA* 



TGGGTGAAAG GATGGGGTAC CTGGGTGACG AGCCCCCGCC 2160 



CCTTTTCTCC CAGCTCCCTT CCAGGTCAAT CCAAACTGGA 2220 



AGACGCCCCA GCAAGCCTCT TTCGGGGAGT CCTCTAGCTC 2280 



CTCACCTCCA TGGGCC AGAC AGCTCTGGCA GGGGGCAGCA GCAGCACCCC CACGCCACAG 2340 

GCCCTGTACC CTGACCTCTC CTGTCCCGAG GGCTTGGAAG AGCTGCTGTC TGCACCCCCT 2400 

CCTGACCTGG GGGCCCAGO; GCGCCACGGT TGGAACCCCA AAGACTGTTC AGAGAACATC 2460 

GAGGTCAAGG AAGGAGGGT V GTACTTTGAG CGGCGGCCCG TGGCCCAGAG CACTGATGGG 2520 

GCCCGGGGTA AGAGGGGCT \ TTCAAGGGGC CTGCACGCCT GGGAGATCAG CTGGCCCCTA 2580 



GAGCAGAGGG GCACGCATG 



CGTGGTGGGC GTGGCCACGG CCCTCGCCCC GCTGCAGACT 2640 



GACCACTACG CGGCGCTGC T GGGCAGCAAC AGCGAGTCGT GGGGCTGGGA CATCGGGCGG 2700 
GGGAAGCTGT ACCATCAGAG CAAGGGGCCC GGAGCCCCCC AGTATCCAGC GGGAACTCAG 2760 
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GGTGAGCAGC TGGAGGTGCC AGAGAGACTG CTGGTGGTTC TGGACATGGA GGAGGGAACT 2820 

CTGGGCTACG CTATTGGGGG CACCTACCTG GGGCCAGCAT TCCGCGGACT GAAGGGCAGG 2880 

ACCCTCTATC CGSCAGTAAG CGCTGTCTGG GGCCAGTGCC AGGTCCGCAT CCGCTACCTG 2940 

GGCGAAAGGA GA3GTGAGGC CTGGGGCAGA CGTGGGGAGA ACTTTCTGTC CCTGGTGGCA 3000 

GTGGTTTGGG AT 3GAAACTC TTCTGACAAG AGCAGAGGGG ATGGACCTTC ATCCAGCCTG 3060 

CCTCAACCTC TG' ^TCAGTGC TGGGAAAGGC TAGGGGTCTT CACAGCTGTT ATTTAATTTA 3120 

ACCCAACAGC AA M AGAGGTG AAACAGGCTT GAGAAAGCAA CTTTCTCAAG TTCTCTTGGC 3180 

CAGTAAATGG TGfACCTTCA GAATGGAGGG AGGAACTGCA GGGATGAGAG AATTCAGGAG 3240 

ATATCAACCC CTC AGCAAGA GGTGCAAAGC GTTAGGTACT GGGTTTGATG TACAGGTCCA 3300 

AAAGAAGGAT GGC CAGAGCC AGGTACCCAG GCTGTATACC GGATTCCCTG GGCTCTAACC 3360 

TGTCTCTGTG CCfl CATACCT ACTTCCTTCC TCAGCCACAC CTCTGGATGG AGACACTGQG 3420 

GCCCTGGGCA CCP GGGAGGA GAGCAGTGGA GGAGGCAGGG CCTTAGGGTG GGGCAGCAGG 3480 

GGAGGAGCCT CCC CAGGAAC TGACTGGGTC CAGGGCTTGG AGCTGCTCTC TGCAGTTGTG 3540 

TGGGCTGTAG AG' 'GGAGGGC CATCCCTCCT CACCTCAGCC CCAGCTCCCA AGCCTCTGGA 3600 

GTCAAfteCCT GGCCCAGCTC CACCACTGTC AGAGCCACCT TGGCCTGTTG TTTAGAGGGC 3660 

CTTAGCCAGC TC'?TCACCCC CAGCTCTGAC TAGGGATGTG TGAAATCTTA TCTGGGAGGC 3720 

AGAACTTCCG GG'PATCTCAA ATTCCCCTTT CAGCCAGGTG GGCACACTCG AAGCAGGAAA 37 80 

GCAGAAAGGC AT^TGAGTAG GACCCCGTAG TTTGAGGACA TCTGGCTGGT GGCTGCACCC 3840 

ATACTTACAT TCOCCTCCTT CTCTCTCCCA GCGGAGCCAC ACTCCCTTCT GCACCTGAGC 3900 

CGCCTGTGTG TGCGCCACAA CCTGGGGGAT ACCCGGCTCG GCCAGGTGTC TGCCCTGCCC 3960 

TTGCCCCCTG CCATGAAGCG CTACCTGCTC TACCAGTGAG CCCTGTGATA CCACAGACTG 4020 

TGCTGAGGTC TTGCCACCAC CCCTCCCCTT GGGGAGGTGG GGAGGCACTG CTGGCCTAGA 4080 

J 

CCAGCTGCTG AAAGCTGGTG AGGCTGAGCC CCTACCCCAA CCCAAGCTCT GCGGAAATCA 4140 
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ACAGCCCCAG AQCCACTTGG AGGGAGGAAG AAAGGGAGCC GGCGTTCAAG GCTATGACAG 4200 

TCTGCTACGC AAAACATTTT TTCAAGTAAA AATAGTAAGA GATGTTGTTA TAGAAACCTG 4260 

TTCTTGTTTT TTTTTTTTTC TTGCACAAAT GATCATTTAT ATAGCTGCCT CAAAAAGGAA 4320 

GATTATCTGG GCAAGTCCAG TGAAGGCAGA CAAACCACAA GACCTAGTGC CAGGTTTATT 4380 

CCCTCACATG GGTGCTTCAC ATACACAGCA CAGAGGCACG GGCACCATGG GAGAGGGCAG 4440 

CACTCCTGCC TTCTQAGGGG ATCTTGGCCT CACGGTGTAA GAAGGGAGAG GATGGTTTCT 4500 

CTTCTGCCCT CACTAGGGCC TAGGGAACCC AGGAGCAAAT CCCACCACGC CTTCCATCTC 4560 

TCAGCCAAGG AGAAGuCACC TTQQTGACGT TTAGTTCCAA CCATTATAGT AAGTGGAGAA 4620 

GGGATTGGCC TGGTCOCAAC CATTACAGGG TGAAGATATA AACAGTAAAG GAAGATACAG 4680 

TTTGGATGAG GCCACAGGAA GGAGCAGATG ACACCATCAG AAGCATATGC AGGGAAAGGG 4740 

CAGTTACTGG GCTTCTGGGC TGCTTAGTCC CTGGCTTGGC AGGAAGGGTA GGGAAGATGG 4800 

ATGGGGCTCA TTGTTTaGCA TTGATGATGT CCACGAATTC GGGCTTGAGG GAAGCACCAC 4860 

CCACAAGGAA GCCATCOACA TCAGGCTGGC TCGCCAGCTC CTTGCAGGTT GCCCCAGTCA 4920 

CAGAGCCTGG GAAGGGAECA GAACAAGGGC TTGGTCAAGA ATGGGATGAG TCTGCCCCAT 4980 

. CCCCAGCTCC ATGTCCGAGG GCTCAGTCTA GTCCTCAGCC CACTCCACCT CAGCCGGGAA 5040 

CCAAAGCCAC TCACCTCCAT AAATGATACG GGTGCTCTGA GCCACCGCAT CAGAGACGTT 5100 

GGACTTCAGC CATCCTCGGA GCTTCTCGTG TACTTCCTGG GCCTAGAACA AGAAGCTGGC 5160 

CTAAGTAAGA CCTTTTCTGC CTCTCTAAGA GGAAAAATCA CTGGCACCAG TGGACACTTA 5220 

GTGTGGTTTC TGACTGAGTC AGAGTACCAG GGCTCTGATC CAAGCCAGGC CCTGGACTGG 5280 

ATGCCCTTGG ACAAGTCACT GTCTCTGGGT TCAAGGTCTC TGTGTCTTTG AAATAAGGGG 5340 

TTGCCCCATG TGGGCTGTGT CTGTCCAAAC CTATTGAGGC AGGCTGGGAT GAGGGCAGGG 5400 

CTCCTGGGCC CGGTTACCTG TTGGGGTGTT GCAGTCTTGC CAGTACCAAT GGCCCACACA 5460 

GGCTCATAGG CCAGGACGAC CTTGCTCCAG TCCTTCACGT TATCTGCAGG GCAGAGATAC 5520 



180 



AGATGGAGGG AaLgTGAAC AAGAAAGAGC TCTCCAGCCA GGTTCTCCGG AGTACGAAGA 5580 



ACGGTGGCCT ACTGCCCCCT AGTGGACATT GGQGG 



(2) INFORMATI ?N FOR SEQ ID NO:48: 



(i) SEQUJENCE 
(A) 
(B) 
(C) 
(D) 



Met Gly 
1 

Gin Ala 



Leu Ser 



Asn Pro 
50 

Tyr Phe 
65 

Lys Arg 



Leu Glu 



5615 



CHARACTERISTICS : 
LENGTH: 263 amino acids 
TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 



(ii) MOLI1CULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



Gin Thr Ala Leu Ala Gly Gly Ser Ser Ser Thr Pro Thr Pro 
5 10 15 

Leu Tyr Pro Asp Leu Ser Cys Pro Glu Gly Leu Glu Glu Leu 
20 25 30 

Ma Pro Pro Pro Asp Leu Gly Ala Gin Arg Arg His Gly Trp 
IS 40 45 

ys Asp Cys Ser Glu Asn He Glu Val Lys Glu Gly Gly Leu 
55 60 

Glu Arg Arg Pro Val Ala Gin Ser Thr Asp Gly Ala Arg Gly 
70 75 80 

£ly Tyr Ser Arg Gly Leu His Ala Trp Glu He Ser Trp Pro 
85 90 95 

£ln Arg Gly Thr His Ala Val Val Gly Val Ala Thr Ala Leu 
100 105 HO 



Ala Pro ijeu Gin Thr Asp His Tyr Ala Ala Leu Leu Gly Ser Asn Ser 

120 125 



Leu 

Jis 



Glu Ser Trp Gly Trp Asp He Gly Arg Gly Lys Leu Tyr His Gin Ser 
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,130 



135 



140 



Lys 
145 

Leu 
Thr 
Gly 
Gin 



His 
225 

Asp 



Gly Pro Gly Ala Pro Gin Tyr Pro Ala Gly Thr Gin Gly Glu Gin 
150 155 l fi0 

lu Val Pro Glu Arg Leu Leu val Val Leu Asp Met Glu Glu Gly 
165 170 175 

,eu Gly Tyr Ala He Gly Gly Thr Tyr Leu Gly Pro Ala Phe Arg 
160 185 190 

leu Lys Gly Arg Thr Leu Tyr Pro Ala Val Ser Ala Val Trp Gly 
195 200 205 

Cys Gin Val Arg He Arg Tyr Leu Gly Glu Arg Arg Ala Glu Pro 
10 215 220 

<;er Leu Leu His Leu Ser Arg Leu eye Val Arg His Asn Leu Gly 
230 235 240 

ijhr Arg Leu Gly Gin Val Ser Ala Leu Pro Leu Pro Pro Ala Met 
24S 250 255 



Lys Ajrg Tyr Leu Leu Tyr Gin 
260 

(2) INFORMATION FOR SEQ ID NO:49: 

_iiJ SIQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
C) STRANDEDNESS ; Single 
( )) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(xi) SECUENCE DESCRIPTION: SEQ ID NO: 49: 
AGCTAGATCT < rGACCCTACA ATCGCAGC 

(2) INFORMATION FOR SEQ ID NO: 50: 
(i) SEQUENCE CHARACTERISTICS: 



(A) 



LENGTH: base pairs 



P:\OPER\EJKv50ai PRV -31/10^7 



-182 



i(B) TYPE", nucleic acid 
\(C) STRANDEDNESS: single 
\d) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 



AGCTAGATCT GGCATCCTAC TCGAGGGGCC AGCTGG 



